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POLYPEPTIDES OF THE "BASIC HELIX-LOOP-HELIX" BHLH 
FAMILY, CORRESPONDING NUCLEIC ACID SEQUENCES 

RELATED APPLICATIONS 

This application is a Continuation-in-Part of application 09/331,356 filed 
July 12, 1999, which is a 35 USC §371 application of PCT/FR97/02368 filed 
December 19, 1997. This application claims the benefit of foreign priority under 
35 U.S.C. § 1 19(a) to application FR96/15651 filed in France on December 19, 
1996. 

FIELD OF THE INVENTION 

The present invention relates to new bHLH-type polypeptides and to their 
corresponding coding nucleic acids. It also relates to expression vectors integrating 
said nucleic acids and to the use of these nucleic acids, vectors, or polypeptides for 
therapeutic purposes. 

BACKGROUND OF THE INVENTION 

The embryonic neuronal tube is at the outset composed of an 
undifferentiated neuroepithelium over its entire length. The cells then differentiate, 
depending on their positions on the anterior/posterior, dorsal/ventral and 
medium/lateral axes thereby generating various neuronal structures. This 
differentiation is in part controlled by genetic factors that subdivide the neuronal 
tube into several histogenic regions. It is now known that numerous transcriptional 
factors and, more particularly, products encoded by the so-called bHLH "Basic 
Helix Loop Helix" genes (Gaudy et al., 1988, Cell, 55, 1061-67) are involved in 
this differentiation phenomenon. The proteins of this family have highly conserved 
amino acid units and more particularly a basic domain followed by Helix 1 and 
Helix 2, The latter two units are separated by a loop of variable size. In vertebrates, 
certain products of these bHLH genes participate in the various stages of 
neurogenesis during neuronal determination and differentiation. By way of 
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representatives of these proteins, there may be mentioned in particular the Math-1 
(Akazawa et aL, 1995, J. Bol. Chem., 279,8730-38), Mash-1 (Johnson et al., 1990, 
346, 858-61), and NeuroD (Lee et al., 1995, Sciences, 268, 836-44) proteins. 
Evidently, other proteins comprising this bHLH domain are involved in neuronal 
5 development at the level of other sites and at variable times. The identification of 
new bHLH-type proteins would be particularly valuable for increasing our 
understanding of neurogenesis and therefore be advantageous from the therapeutic 
point of view. 

Members of the basic helix loop helix (bHLH) transcription factor family 
1 0 play a key role in a wide variety of cell fate determination and specification 
processes. These processes are as diverse as cardial muscle development^ 

2 3 

skeletal development , mesodermal cell determination , dermal cell 
differentiation"* and more particularly, myogenesis^ and neurogenesis^. 

Our understanding of the molecular basis of vertebrate nervous system 
15 development has benefitted from the identification of bHLH factors analogous to 
those regulating neurogenesis in Drosophila, In the fly, proneural bHLH genes, 
such as the four genes of the achaete-scute complex {AS-C) and the atonal gene 
are required for the fate determination of neuronal precursors within the 
ectoderm^' ^. The expression of proneural genes in Drosophila is restricted to the 
2 0 neural precursor cells by lateral inhibition mediated by the Notch signaling 

8 9 

pathway ' . The search for vertebrate homologs of the Drosophila proneural 
genes has led to the identification of numerous genes encoding bHLH proteins. 
The AS-C homolog Mashl^ the atonal homolog Mathl and the most recently 
identified genes of the neurogenin {ngn) family, including ngnI/Math4C, 

2 5 ngn2/Math4A and ngn3/Math4B/Relax have temporal and spatial expression 

patterns consistent with a function in neuronal determination^^. The three 
neurogenin genes encode basic helix loop helix transcription factors. Functional 
analysis of these murine genes has shown that they play a key role in the 
determination of neuronal cell fate. In contrast, several other genes of the 

3 0 NeuroD family, including NeuroD/Beta2, Math2/Nexl , Math3 are more likely to 
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be involved in differentiation than determination^^. Functional analysis of these 
vertebrate genes has clearly suggested that the overall mechanism controlling 
neurogenesis is conserved throughout evolution at the fate determination and 
specification levels. The precise role in the developing nervous system of the 
5 putative neuronal determination genes has been investigated by gain-of-function 
and loss-of- function studies. The overexpression in Xenopus or Zebrafish 
embryos of some of the latter genes converts non-neural ectoderm into neurons 
suggesting that these genes encode neuronal determination factors* Analysis 
of null mutations of Mashl^ ngnl and ngn2 has demonstrated that these genes 
10 have true determination functions affecting the development of at least a subset 
of neuronal cells' ^'*^. 

SUMMARY OF THE INVENTION 

The subject of the present invention is precisely the isolation, sequencing 
1 5 and characterization of cDNAs encoding polypeptides that belong to a new family 
of bHLH-type proteins. It also describes recombinant DNAs incorporating these 
nucleic acids, vectors containing them, and their use for activating the expression 
of recombinant proteins. 

Applicants have now identified polypeptides possessing a high homology 
2 0 with the bHLH proteins already identified, a homology that is limited to the bHLH 
domain. Indeed, unexpectedly, the regions bordering this bHLH domain share no 
amino acid sequence identity with those of other bHLH-type proteins. 

Consequently, the first subject of the present invention is a bHLH-type 
polypeptide endowed with a transcriptional activity, characterized in that it shares 

2 5 amino acid sequence homology with the bHLH-type proteins only at the level of its 

bHLH domain. 

More precisely, the present invention relates to a polypeptide possessing a 
transcriptional activity, characterized in that it comprises completely or partly an 
amino acid sequence of either one of SEQ ID NOs: 8 or 10, or a derivative thereof. 

3 0 Preferably, it is a bHLH-type polypeptide comprising an amino acid 
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sequence represented in either one of SEQ ED NOs: 8 or 10. In a preferred 
embodiment, the bHLH-type polypeptide comprises 214 amino acids (SEQ ID NO: 
8), for which the expression of the corresponding mRNA is detected in rat embryo 
in the form of longitudinal bands. This protein is designated hereinafter by the 
5 name Relax for "Rat Embryonic Longitudinal Axis" and constitutes a polypeptide 
according to the invention. 

Comparison of the amino acid sequence of Relax with those of already 
available proteins has made it possible to establish a sequence correlation 
essentially with the bHLH-type proteins. Moreover, this homology has been 
10 located only at the level of the bHLH domain. The bHLH domain of Relax thus 
shares respectively 68%, 57% and 40% homology with the bHLH domains of the 
NeuroD, Math-1 and Mash-1 proteins. 

From tests of activity carried out with the Relax protein and more precisely 
discussed in the examples below, it is evident that this protein is capable of binding 
15 specifically to an E box sequence and that in cotransfection trials, it behaves like a 
transcriptional activator and that this activity is strictly dependent on the presence 
of an intact E box. All these observations demonstrate the membership of the 
claimed Relax protein of a new family of bHLH proteins. 

In another preferred embodiment, the bHLH-type polypeptide comprises 
2 0 214 amino acids (SEQ ID NO: 10), which is designated hereinafter as human 
neurogenin 3 (hngn3) and constitutes a polypeptide according to the invention. 

The subject of the present invention is also a nucleic acid that encodes a 
bHLH-type polypeptide according to the invention. 

More precisely, the present invention relates to a nucleic acid comprising a 

2 5 polynucleotide sequence, characterized in that it is represented completely or partly 

by SEQ ID NOs: 1 or 9, or a complementary polynucleotide sequence or derivative 
thereof. 

The claimed nucleic acids may be complementary DNA (cDNA), genomic 
DNA (gDNA) or RNA fragments. They may be more particularly cDNA or gDNA. 

3 0 More preferably, it is a nucleic acid comprising a polynucleotide sequence 
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of either one of SEQ ED NOs: 1 or 9, or of a complementary polynucleotide 
sequence thereof 

The present invention is also intended to cover corresponding anti -sense 
5 nucleic acids whose expression makes it possible to control the transcription of 
cellular mRNAs. Such nucleic acids may comprise all or part of a nucleic acid 
polynucleotide sequence according to the invention, transcribed in the reverse 
orientation. They are complementary or significantly complementary to the 
polynucleotide sequence of a nucleic acid according to the invention, either in total 
10 or in part. 

The subject of the present invention is also the use of the polypeptides of 
the invention to control and/or participate in the expression of genes. These 
transcriptional activators may be most particularly advantageous for targeting the 
expression of a protein at the level of the central nervous system. 

1 5 The polypeptides of the invention may be obtained by expressing, in a 

cellular host, a nucleic acid as described above, incorporated or otherwise into a 
recombinant DNA, using the techniques known to persons skilled in the art, or by a 
combination of these techniques. 

Thus, the invention relates to a method for the production of a 

2 0 polypeptide according to the invention, wherein said method comprises the steps 
of: 

a) inserting a nucleic acid encoding said polypeptide into an appropriate 

vector; 

b) culturing, in an appropriate culture medium, a previously transformed 

2 5 host cell or transfecting a host cell with the recombinant vector of step a); 

c) recovering the conditioned culture medium or lysing the host cell, for 
example by sonication or by osmotic shock; 

d) separating and purifying said polypeptide from said culture medium or 
alternatively from the cell lysates obtained in step c); and 

3 0 e) where appropriate, characterizing the recombinant polypeptide produced. 
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Preferably, the nucleic acids according to the invention form part of a 
vector useful for inducing in vivo, ex vivo and/or in vitro the expression of the 
claimed polypeptides. The vector used may be of various origins, provided that it is 
capable of transforming animal cells, preferably human nerve cells. Jn a preferred 
embodiment of the invention, a viral vector, which may be derived from 
adenoviruses, retroviruses, adeno-associated viruses (AAV), the herpesvirus, the 
cytomegalovirus (CMV), the vaccinia virus, and the like, is used. 

The present invention therefore also relates to any recombinant virus 
comprising, inserted into its genome, a nucleic acid according to the invention. 
Preferably, the recombinant virus according to the invention is a replication 
defective virus. 

It is particularly advantageous to use the nucleic sequences of the invention 
in a form incorporated into an adenovirus, an AAV or a recombinant retrovirus that 
is defective. According to a preferred embodiment, it is an adenovirus. The vector, 
nucleic acid or polypeptide of the invention can be in an isolated or purified form. 

The subject of the present invention is also a pharmaceutical composition 
comprising at least one nucleic acid, vector or polypeptide according to the 
invention and a pharmaceutically compatible excipient. 

It also relates to any use of a nucleic acid, recombinant vector, or 
polypeptide according to the invention for the manufacture or preparation of a 
pharmaceutical composition or medicament intended for the treatment of a disease, 
condition, or pathology affecting the nervous system. 

The present invention also provides nucleotide probes and primers that 
hybridize w^ith a nucleic acid sequence of a nucleic acid according to the 
invention. The nucleotide probes or primers according to the invention comprise 
at least 8 consecutive nucleotides of a nucleic acid comprising a polynucleotide 
sequence of either one of SEQ ID NOs: 1 or 9, or of a complementary 
polynucleotide sequence. Preferably, nucleotide probes or primers according to 
the invention will have a length of at least 10, 12, 15, 18, 20 to 25, 35, 40, 50, 
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70, 80, 100, 200, 500, 1000, or 1500 consecutive nucleotides of a nucleic acid 
according to the invention. 

The present invention also relates to a method of amplifying a nucleic acid 
according to the invention contained in a sample, wherein said method comprises 
5 the steps of: 

a) bringing the sample in which the presence of the target nucleic acid is 
suspected into contact with a pair of nucleotide primers whose hybridization 
position is located respectively on the 5 'side and on the 3 'side of the region of the 

10 target nucleic acid whose amplification is sought, in the presence of the reagents 
necessary for the amplification reaction; and 

b) detecting the amplified nucleic acids. 

The present invention also relates to a method of detecting the presence of a 
nucleic acid according to the invention in a sample, wherein said method comprises 
15 the steps of: 

1) bringing one or more nucleotide probes according to the invention into 
contact with the sample to be tested; 

2) detecting the complex which may have formed between the probe(s) and 
the nucleic acid present in the sample. 

2 0 Another subject of the invention is a box or kit for amplifying all or part of 

a nucleic acid according to the invention, wherein said box or kit comprises: 

1) a pair of nucleotide primers in accordance with the invention, whose 
hybridization position is located respectively on the 5 'side and 3 'side of the 
target nucleic acid whose amplification is sought; and optionally, 

2 5 2) reagents necessary for an amplification reaction. 

The invention also relates to a box or kit for detecting the presence of a 
nucleic acid according to the invention in a sample, said box or kit comprising: 

a) one or more nucleotide probes according to the invention; and 

b) where appropriate, reagents necessary for a hybridization reaction. 

3 0 The present invention also relates to a method of screening for antagonists 




o 

ST96042A US 

8 

or agonists of the polypeptides according to the invention. Preferably, the method 
of screening for antagonists or agonists of the polypeptides according to the 
invention is directed toward identifying an antagonist or agonist of a polypeptide 
comprising an amino acid sequence of either one of SEQ ED NO: 8 or SEQ ID NO: 
5 10. 

In a specific embodiment, the present invention relates to a method of 
identifying a modulator, agonist, or antagonist of a polypeptide according to the 
invention in a sample comprising 

a) obtaining a cell, for example a cell line, that, either naturally or after 
1 0 transfecting the cell with a nucleic acid encoding a polypeptide according to the 

invention, expresses a polypeptide according to the invention, 

b) transfecting the cell with a nucleic acid encoding a marker gene, such as 
p-galactosidase (p-gal), 

c) incubating the cell of step b) with a test solution or sample comprising a 
15 potential modulator, agonist or antagonist, 

d) measuring the amount of p-galactosidase activity, and 

e) comparing the amount of p-galactosidase activity measured in step d) 
with an amount of p-galactosidase activity measured with a cell that has not been 
previously incubated in the presence of the candidate modulator, agonist, or 

2 0 antagonist compound for the polypeptide according to the invention. 

The present invention also relates to an antibody directed against a 
polypeptide according to the invention. Preferably, the antibody of the invention is 
directed against a polypeptide comprising an amino acid sequence of either one of 
SEQ ID NO: 8 or SEQ ID NO: 10. 

2 5 The present invention relates to a method of detecting the presence of a 

polypeptide of the invention in a sample comprising the polypeptide. The present 
invention also relates to a kit for detecting the presence of a polypeptide of the 
invention in a sample comprising the polypeptide. 

The present invention relates to a method of detecting the presence of a 

3 0 nucleic acid according to the invention in a sample comprising the nucleic acid. 
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The present invention also relates to a kit for detecting the presence of a nucleic 
acid according to the invention in a sample comprising the nucleic acid. 

The present invention also relates to a composition comprising a nucleic 
acid encoding a polypeptide according to the invention, wherein the nucleic acid is 
placed under the control of appropriate regulatory elements. 

Thus, the present invention also relates to a pharmaceutical composition 
comprising a nucleic acid, polypeptide, or recombinant vector according to the 
invention, combined with one or more physiologically compatible vehicles and/or 
excipients. 

The invention relates to an isolated recombinant host cell comprising a 
nucleic acid of the invention. 

According to another aspect, the invention also relates to an isolated 
recombinant host cell comprising a recombinant vector according to the invention. 
Therefore, the invention also relates to a recombinant host cell comprising a 
recombinant vector comprising a nucleic acid of the invention. 

The present invention also relates to the use of isolated cells genetically 
modified ex vivo with a nucleic acid or recombinant vector according to the 
invention, or of isolated cells producing a recombinant vector, wherein the cells are 
implanted in the body, to allow a prolonged and effective expression in vivo of a 
biologically active polypeptide according to the invention. 

Thus, the invention also relates to the use of a recombinant host cell 
according to the invention, comprising a nucleic acid encoding a polypeptide 
according to the invention for the manufacture or preparation of a pharmaceutical 
composition or medicament intended for the prevention of, or more particularly, 
for the treatment of subjects affected by a nervous system pathology, dysfunction, 
disorder or condition. 

The invention relates to the use of a defective recombinant virus according 
to the invention for the preparation of a pharmaceutical composition intended for 
the treatment and/or for the prevention of a disorder or condition affecting the 
nervous system. Thus, the present invention also relates to a pharmaceutical 
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composition comprising a defective recombinant virus according to the invention, 
combined with one or more physiologically compatible vehicles and/or excipients. 

The present invention also relates to the use of recombinant host cells 
genetically modified ex vivo with a recombinant defective virus according to the 
invention, or of recombinant host cells producing such viruses, implanted in the 
body, allowing a prolonged and effective expression in vivo of a biologically active 
protein of the invention. A specific embodiment of the invention is an isolated 
mammalian cell infected with one or more defective recombinant viruses according 
to the invention. 

Another subject of the invention relates to an implant comprising isolated 
mammalian cells infected with one or more defective recombinant viruses 
according to the invention or cells producing recombinant viruses, and an 
extracellular matrix. More particularly, in the implants of the invention, the 
extracellular matrix comprises a gelling compound and optionally, a support 
allowing the anchorage of the cells. 

Finally, the invention extends, in addition, to any use of a polypeptide, 
nucleic acid, recombinant vector, recombinant virus, recombinant host cell, or 
implant according to the invention to prevent and/or treat a dysfunction, disease, 
condition, or pathology affecting the nervous system or to control, influence, 
and/or participate in the expression of a gene. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 : Nucleotide sequence of the cDNA CIG235. The various oligonucleotides 
used to isolate the complete cDNA are indicated on this sequence. The arrows 
indicate the 5' to 3* orientation of these oligonucleotides. 

Figure 2: Comparison of the amino acid sequences of the bHLH domain of Relax 
with other proteins of the family. The homologous amino acids are noted in black, 
and the amino acids of similar structure in grey. 

Figure 3: Sequence of the hngnS gene. The hngnS coding region is indicated by 
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capital letters. The deduced amino acid sequence is presented above the 
nucleotidic sequence. Both nucleotide and amino acid numbers are indicated on 
the left. The 138 bp PGR product amplified from the basic and helix II domain is 
underlined. The black arrow marks the 5' end of the 3 '-anchored PGR product 

5 whereas the gray arrow indicates the 3' end of the 5 '-anchored PGR product. 
The intron sequence interrupting the 5' untranslated region is boxed in gray. 

Figure 4: The amino acid sequences encoded by hngnS, ngn3, relax^ ngnl, ngn2 
and neuroDS genes were aligned using the Glustal-X program. The identity 
between the amino acid residues is displayed in color according to Glustal-X 
10 program parameters. (*) indicates residues conserved in all sequences. 

Figure 5: Optimal aligrmients obtained with the Glustal-X program were 
subjected to phylogenetic analysis with distance methods (Neighbor- Joining). 
The resulting trees correspond to the alignment of the bHLH domains only (a) or 
of the entire amino acid sequences (b). Bootstrap values are indicated near each 

15 branch, (a) Five paralogous groups highlighted on the right were clearly 

identified. This result is supported by the bootstrap value boxed in gray, (b) The 
phylogenetic relationships within the neurogenin family define three paralogous 
groups. All known members of the neurogenin are represented in this tree. Gene 
names according to the original publications are indicated in capital letters. The 

2 0 species from which the genes were isolated are indicated as a lower case prefix, 
dm = Drosophila melanogaster; ce = Caenorhabditis elegans; dr = Danio rerio; 
xl = Xenopus laevis\ gg = Gallus gallus; mm = Mus musculus; m = Rattus 
norvegicus\ hs — Homo sapiens. Genebank accession numbers are as follows: 
dmDA = J03148; dmTS = X52892; dm T5 = M17120; dmT4 = Ml 71 19; dmT3 

25 = X12549; ceLIN32 = U15418; dmATONAL = L36646; drATHl = AF024536; 
ggGATHl = U61 149; mmMATHl = D43694; hsHATHl = U61 148; mmMath3 
= D85845; drNEUROD = AF036148; mmMATH2 = D44480; mmNEUROD2 = 
U58771; mmNEUROD - U28068; hsNEUROD - D82347; dmTAP = X95845; 
xlNGNRlA - U67778; mmNGN2 = U76207; ggNGN2 = AJ012659; drNGNl = 
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AF017301; ggNGNl = AJ012660; mmNGNl = U67776; mNGNl - U67777; 
hsNEUROD3 = U63842; mmNGN3 - U76208; mRELAX = Y10619; 
hsHNGN3 = AJ133776; mmTWIST = M63649; mmSCLERAXIS = S78079; 
mmPARAXIS = U18658; mmEHAND = S79216; mmDHAND - U40039. 

5 Figure 6: The chromosomal location of hngnS was determined by radiation 
hybrid mapping. Linkage analysis mapped hngn3 to 10q21.3, between the 
D10S1418 (GDB 684330) and D10S1603 (GDB 589266) markers (boxed in 
gray). A comprehensive map of chromosome 10 is shown on the left-hand side 
and a fine map of the region containing hngn3 is shown on the right-hand side. 

1 0 The maps were obtained using the mapview program of the genome data base 
(GDB) package. The name of each gene and marker and their GDB accession 
numbers are as follow: D10S1418 (GDB 684330); D10S1603 (GDB 589266); 
CHAT = choline acetyltransferase (GDB 1 19775); PAX2 = paired box homeotic 
gene 2 (GDB 138771), ANX7 = annexin VH (GDB 369042 ); PSAP = 

15 prosaponin (GDB 120366); PRGl = platelet proteoglycan 1 protein core (GDB 
120312); GLUDP3 = glutamate dehydrogenase pseudogene 3 (GDB 137196); 
ANK3 = ankyrin 3 (GDB 424503); DNA2L = DNA replication helicase (yeast 
homolog) -like (GDB 1313702); MATl A = methionine adenosyltransferase I 
alpha (GDB 129077); CAMKG = CaM kinase II gamma (GDB 138469); HKl = 

2 0 hexokinase 1 (GDB 120044). 

Figure 7: Transverse sections of human embryos at Carnegie stage 18 were probed 
with an hngnS antisense riboprobe (a), (c)-(h) and a sense probe (b). In the spinal 
cord (a), (b), (e)-(h) hngnS-positivG cells are present in two longitudinal stripes (a) 
and (e). The ventral stripe (full arrow) is adjacent to the floor plate (fp). The medial 

2 5 stripe (arrow head) is located around the midpoint of the dorso-ventral axis. Both 

stripes are contained within the ventricular zone as shown by phase contrast 
microscopy (f). (g) high magnification of the ventral stripe, (h) high magnification 
of the medial stripe. hngnS is expressed in scattered cells throughout the pancreas 
as shown on the transverse section of the left lobe 

3 0 of the pancreas. At higher magnification, the hngnS-positiwQ cells appear to be 
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organized 

as clusters (d). Scale bars (a) and (b) 200|am; (c) 100|im; (e) and (f) 50|am; (d), (g) 
and (h) 25|im. 

5 DETAILED DESCRIPTION OF THE INVENTION 

Definitions 

The following defined terms are used throughout the present specification, 
and should be helpful in understanding the scope and practice of the present 
10 invention. 

Li a specific embodiment, the term "about" or "approximately" means 
within 20%, preferably within 10%, and more preferably within 5% of a given 
value or range. 

A "nucleic acid" is a polymeric compound comprised of covalently linked 
15 subunits called nucleotides. Nucleic acid includes polyribonucleic acid (RNA) and 
polydeoxyribonucleic acid (DNA), both of which may be single-stranded or 
double-stranded. DNA includes cDNA, genomic DNA, synthetic DNA, and semi- 
synthetic DNA. 

A "gene" refers to an assembly of nucleotides that encode a polypeptide, 
2 0 and includes cDNA and genomic DNA nucleic acids. 

A "recombinant DNA molecule" is a DNA molecule that has undergone a 
molecular biological manipulation. 

A "vector" is any means for the transfer of a nucleic acid into a host cell. A 
vector may be a replicon to which another DNA segment may be attached so as to 

2 5 bring about the replication of the attached segment. A "replicon" is any genetic 

element (e.g., plasmid, phage, cosmid, chromosome, virus) that functions as an 
autonomous unit of DNA replication in vivo, i.e., capable of replication under its 
own control. The term "vector" includes both viral and nonviral means for 
introducing the nucleic acid into a cell in vitro, ex vivo or in vivo. Viral vectors 

3 0 include retrovirus, adeno-associated virus, pox, baculovirus, vaccinia, herpes 
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simplex, Epstein-Barr and adenovirus vectors, as set forth in greater detail below. 
Non-viral vectors include plasmids, liposomes, electrically charged lipids 
(cytofectins), DNA-protein complexes, and biopolymers. In addition to a nucleic 
acid, a vector may also contain one or more regulatory regions, and/or selectable 
5 markers useful in selecting, measuring, and monitoring nucleic acid transfer results 
(transfer to which tissues, duration of expression, etc.). 

A "cloning vector" is a replicon, such as plasmid, phage or cosmid, to 
which another DNA segment may be attached so as to bring about the replication 
of the attached segment. Cloning vectors may be capable of replication in one cell 

1 0 type, and expression in another ("shuttle vector"). 

A "cassette" refers to a segment of DNA that can be inserted into a vector at 
specific restriction sites. The segment of DNA encodes a polypeptide of interest, 
and the cassette and restriction sites are designed to ensure insertion of the cassette 
in the proper reading frame for transcription and translation. 

15 A cell has been "transfected" by exogenous or heterologous DNA when 

such DNA has been introduced inside the cell. A cell has been "transformed" by 
exogenous or heterologous DNA when the transfected DNA effects a phenotypic 
change. The transforming DNA can be integrated (covalently linked) into 
chromosomal DNA making up the genome of the cell. 

2 0 A "nucleic acid molecule" refers to the phosphate ester polymeric form of 

ribonucleosides (adenosine, guanosine, uridine or cytidine; "RNA molecules") or 
deoxyribonucleosides (deoxyadenosine, deoxyguanosine, deoxythymidine, or 
deoxycytidine; "DNA molecules"), or any phosphoester anologs thereof, such as 
phosphorothioates and thioesters, in either single stranded form, or a double- 

2 5 stranded helix. Double stranded DNA-DNA, DNA-RNA and RNA-RNA helices 

are possible. The term nucleic acid molecule, and in particular DNA or RNA 
molecule, refers only to the primary and secondary structure of the molecule, and 
does not limit it to any particular tertiary forms. Thus, this term includes double- 
stranded DNA found, inter alia, in linear or circular DNA molecules {e.g., 

3 0 restriction fi-agments), plasmids, and chromosomes, hi discussing the structure of 
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particular double-stranded DNA molecules, sequences may be described herein 
according to the normal convention of giving only the sequence in the 5' to 3' 
direction along the nontranscribed strand of DNA (i.e., the strand having a 
sequence homologous to the mRNA). A "recombinant DNA molecule" is a DNA 
molecule that has undergone a molecular biological manipulation. 

For the purposes of the present invention, the term "derivative" designates 
any sequence differing from the sequence considered as a result of a degeneracy of 
the genetic code, obtained by one or more modifications of a genetic and/or 
chemical nature, as well as any sequence that hybridizes with these sequences or 
fragments thereof and whose product possesses the activity indicated. 

Modification of a genetic and/or chemical nature is understood to mean any 
mutation, substitution, deletion, addition and/or modification of one or more 
residues. Such derivatives may be generated for various purposes, such as in 
particular that of enhancing its production levels that of increasing and/or 
modifying its activity, or that of conferring new pharmacokinetic and/or biological 
properties on it. Among the derivatives resulting fi-om an addition, there may be 
mentioned, for example, the chimeric nucleic sequences comprising an additional 
heterologous part linked to one end, for example of the hybrid construct type 
consisting of a cDNA with which one or more introns would be associated. 

Likewise, for the purposes of the invention, the claimed nucleic acids may 
comprise promoter, activating or regulatory sequences, and the like. 

A nucleic acid molecule is "hybridizable" to another nucleic acid molecule, 
such as a cDNA, genomic DNA, or RNA, when a single stranded form of the 
nucleic acid molecule can anneal to the other nucleic acid molecule under the 
appropriate conditions of temperature and solution ionic strength (see Sambrook et 
al., supra). The conditions of temperature and ionic strength determine the 
"stringency" of the hybridization. For preliminary screening for homologous 
nucleic acids, low stringency hybridization conditions, corresponding to a Tm of 
55°, can be used, e.g., 5x SSC, 0.1% SDS, 0.25% milk, and no formamide; or 30% 
formamide, 5x SSC, 0.5% SDS). Moderate stringency hybridization conditions 
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correspond to a higher Tm, e.g., 40% formamide, with 5x or 6x SCC. High 
stringency hybridization conditions correspond to the highest Tm, e.g., 50% 
formamide, 5x or 6x SCC. Hybridization requires that the two nucleic acids 
contain complementary sequences, although depending on the stringency of the 
5 hybridization, mismatches between bases are possible. The appropriate stringency 
for hybridizing nucleic acids depends on the length of the nucleic acids and the 
degree of complementation, variables well known in the art. The greater the degree 
of similarity or homology between two nucleotide sequences, the greater the value 
:£] of Tm for hybrids of nucleic acids having those sequences. The relative stability 



10 (corresponding to higher Tm) of nucleic acid hybridizations decreases in the 

following order: RNA:RNA, DNA:RNA, DNAiDNA. For hybrids of greater than 
100 nucleotides in length, equations for calculating Tm have been derived (see 
Sambrook et al., supra, 9.50-0.51). For hybridization with shorter nucleic acids, 
i.e., oligonucleotides, the position of mismatches becomes more important, and the 

15 length of the oligonucleotide determines its specificity (see Sambrook et al., supra, 
1 1 .7-1 1 .8). Preferably a minimum length for a hybridizable nucleic acid is at least 
about 10 nucleotides; preferably at least about 15 nucleotides; and more preferably 
the length is at least about 20 nucleotides. 

hi a specific embodiment, the term "standard hybridization conditions" 

2 0 refers to a Tm of 55°C, and utilizes conditions as set forth above. In a preferred 
embodiment, the Tm is 60°C; in a more preferred embodiment, the Tm is 65°C. 

"High stringency hybridization conditions" for the purposes of the present 
invention will be understood to mean the following conditions: 
1- Membrane competition and PREHYBRIDIZATION : 

2 5 - Mix: 40 |il salmon sperm DNA (10 mg/ml) 

+ 40 [il human placental DNA (10 mg/ml) 

- Denature for 5 minutes at 96°C, then immerse the mixture in ice. 

- Remove the 2X SSC and pour 4 ml of formamide mix in the hybridization tube 
containing the membranes. 

3 0 - Add the mixture of the two denatured DNAs. 
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- Incubation at 42°C for 5 to 6 hours, with rotation. 
2- Labeled probe competition : 

- Add to the labeled and purified probe 10 to 50 |li1 Cot I DNA, depending on the 
5 quantity of repeats. 

- Denature for 7 to 10 minutes at 95°C. 

- Incubate at 65°C for 2 to 5 hours. 

::] 

5 3- HYBRIDIZATION : 

irj 

^ 1 0 - Remove the prehybridization mix. 

^ - Mix 40 \xl salmon sperm DNA -i- 40 |il human placental DNA; denature for 5 min 

p at 96°C, then immerse in ice. 

■■•J 

- Add to the hybridization tube 4 ml of formamide mix, the mixture of the two 
DNAs and the denatured labeled probe/Cot I DNA . 

^ 15 - Incubate 15 to 20 hours at 42°C, with rotation. 



4- Washes and Exposure : 

- One wash at room temperature in 2X SSC, to rinse. 

- Twice 5 minutes at room temperature 2X SSC and 0.1% SDS at 65°C. 

2 0 - Twice 15 minutes at 65''C IX SSC and 0.1% SDS at 65°C. 

- Envelope the membranes in clear plastic wrap and expose. 



The hybridization conditions described above are adapted to hybridization, 
under high stringency conditions, of a molecule of nucleic acid of varying length 

2 5 from 20 nucleotides to several hundreds of nucleotides. It goes without saying that 

the hybridization conditions described above may be adjusted as a function of the 
length of the nucleic acid whose hybridization is sought or of the type of labeling 
chosen, according to techniques known to one skilled in the art. Suitable 
hybridization conditions may, for example, be adjusted according to the teaching 

3 0 contained in Hames and Higgins, 1985 (Nucleic acid hybridization : a practical 
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approach, Hames and Higgins Ed., IRL Press, Oxford) and Ausubel et al., 1989 
(Current Protocols in Molecular Biology, Green Publishing Associates and Wiley 
Literscience, N.Y.). 

As used herein, the term "oligonucleotide" refers to a nucleic acid, 
generally of at least 18 nucleotides, that is hybridizable to a genomic DNA 
molecule, a cDNA molecule, or an mRNA molecule encoding a polypeptide of the 

32 

mvention. Oligonucleotides can be labeled, e.g., with P-nucleotides or 
nucleotides to which a label, such as biotin, has been covalently conjugated. In one 
embodiment, a labeled oligonucleotide can be used as a probe to detect the 
presence of a nucleic acid encoding a polypeptide of the invention. In another 
embodiment, oligonucleotides (one or both of which may be labeled) can be used 
as PGR primers, either for cloning full length or a fragment of a polypeptide of the 
invention, or to detect the presence of nucleic acids encoding a polypeptide of the 
invention. In a further embodiment, an oligonucleotide of the invention can form a 
triple helix with a DNA molecule of the invention. Generally, oligonucleotides are 
prepared synthetically, preferably on a nucleic acid synthesizer. Accordingly, 
oligonucleotides can be prepared with non-naturally occurring phosphoester analog 
bonds, such as thioester bonds, etc. 

A DNA "coding sequence" is a double-stranded DNA sequence that is 
transcribed and translated into a polypeptide in a cell in vitro or in vivo when 
placed under the control of appropriate regulatory sequences. The boundaries of 
the coding sequence are determined by a start codon at the 5' (amino) terminus and 
a translation stop codon at the 3' (carboxyl) terminus. A coding sequence can 
include, but is not limited to, prokaryotic sequences, cDNA from eukaryotic 
mRNA, genomic DNA sequences from eukaryotic (e.g. , mammalian) DNA, and 
even synthetic DNA sequences. If the coding sequence is intended for expression 
in a eukaryotic cell, a polyadenylation signal and transcription termination 
sequence will usually be located 3' to the coding sequence. 

Transcriptional and translational control sequences are DNA regulatory 
sequences, such as promoters, enhancers, terminators, and the like, that provide for 
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the expression of a coding sequence in a host cell, hi eukaryotic cells, 
polyadenylation signals are control sequences. 

A "promoter sequence" is a DNA regulatory region capable of binding 
RNA polymerase in a cell and initiating transcription of a downstream (3 ' 
5 direction) coding sequence. For purposes of defining the present invention, the 
promoter sequence is bounded at its 3' terminus by the transcription initiation site 
and extends upstream (5' direction) to include the minimum number of bases or 
elements necessary to initiate transcription at levels detectable above background. 
Within the promoter sequence will be found a transcription initiation site 
1 0 (conveniently defined for example, by mapping with nuclease SI), as well as 
protein binding domains (consensus sequences) responsible for the binding of 
RNA polymerase. 

A coding sequence is "under the control" of transcriptional and translational 
control sequences in a cell when RNA polymerase transcribes the coding sequence 
15 into mRNA, which is then trans-RNA spliced (if the coding sequence contains 
introns) and translated into the protein encoded by the coding sequence. 

As used herein, the term "homologous" in all its grammatical forms and 
spelling variations refers to the relationship between proteins that possess a 
"common evolutionary origin," including proteins fi-om superfamilies (e.g., the 
2 0 immunoglobulin superfamily) and homologous proteins firom different species 
(e.g., myosin light chain, etc.) (Reeck et aL, 1987, Cell 50:667). Such proteins 
(and their encoding genes) have sequence homology, as reflected by their high 
degree of sequence similarity. 

Accordingly, the term "sequence similarity" in all its grammatical forms 

2 5 refers to the degree of identity or correspondence between nucleic acid or amino 

acid sequences of proteins that may or may not share a common evolutionary 
origin (see Reeck et ah, supra). However, in common usage and in the instant 
application, the term "homologous," when modified with an adverb such as 
"highly," may refer to sequence similarity and not a common evolutionary origin. 

3 0 In a specific embodiment, two DNA sequences are "substantially 
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homologous" or "substantially similar" when at least about 50% (preferably at least 
about 75%, and most preferably at least about 90 or 95%) of the nucleotides match 
over the defined length of the DNA sequences. Sequences that are substantially 
homologous can be identified by comparing the sequences using standard software 
available in sequence data banks, or in a Southern hybridization experiment under, 
for example, stringent conditions as defined for that particular system. Defining 
appropriate hybridization conditions is within the skill of the art. See, e.g., 
Maniatis et aL, supra\ DNA Cloning, Vols. I & II, supra; Nucleic Acid 
Hybridization, supra. 

An "antisense nucleic acid" is a sequence of nucleotides that is 
complementary to the sense sequence. Antisense nucleic acids can be used to 
down regulate or block the expression of the polypeptide encoded by the sense 
strand. The present invention is also intended to cover corresponding anti-sense 
nucleic acids whose expression makes it possible to control the transcription of 
cellular mRNAs. Such nucleic acids may comprise all or part of a nucleic acid 
polynucleotide sequence according to the invention, transcribed in the reverse 
orientation. They are complementary or significantly complementary to the 
polynucleotide sequence of a nucleic acid according to the invention, either in total 
or in part. 

Transcriptional and translational control sequences are DNA regulatory 
sequences, such as promoters, enhancers, terminators, and the like, that provide for 
the expression of a coding sequence in a host cell. In eukaryotic cells, 
polyadenylation signals are additional types of control sequences. 

A "signal sequence" is included at the beginning of the coding sequence of 
a protein to be expressed on the surface of a cell. This sequence encodes a signal 
peptide, N-terminal to the mature polypeptide, that directs the host cell to 
translocate the polypeptide. The term "translocation signal sequence" is used 
herein to refer to this sort of signal sequence. Translocation signal sequences can 
be found associated with a variety of proteins native to eukaryotes and prokaryotes, 
and are often functional in both types of organisms. 
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"Regulatory region" means a nucleic acid sequence which regulates the 
expression of a second nucleic acid sequence. A regulatory region may include 
sequences which are naturally responsible for expressing a particular nucleic acid 
(a homologous region) or may include sequences of a different origin which are 
5 responsible for expressing different proteins or even synthetic proteins (a 

heterologous region). Li particular, the sequences can be sequences of eukaryotic 
or viral genes or derived sequences which stimulate or repress transcription of a 
gene in a specific or non-specific manner and in an inducible or non-inducible 
manner. Regulatory regions include origins of replication, RNA splice sites, 
1 0 promoters, enhancers, transcriptional termination sequences, signal sequences 
which direct the polypeptide into the secretory pathways of the target cell, and 
promoters. 

A regulatory region from a "heterologous source" is a regulatory region 
which is not naturally associated with the expressed nucleic acid. Included among 
1 5 the heterologous regulatory regions are regulatory regions from a different species, 
regulatory regions from a different gene, hybrid regulatory sequences, and 
regulatory sequences which do not occur in nature, but which are designed by one 
having ordinary skill in the art. 

"Heterologous" DNA refers to DNA not naturally located in the cell, or in a 
2 0 chromosomal site of the cell. Preferably, the heterologous DNA includes a gene 
foreign to the cell. 

"Homologous recombination" refers to the insertion of a foreign DNA 
sequence into another DNA molecule, e.g., insertion of a vector in a chromosome. 
Preferably, the vector targets a specific chromosomal site for homologous 

2 5 recombination. For specific homologous recombination, the vector will contain 

sufficiently long regions of homology to sequences of the chromosome to allow 
complementary binding and incorporation of the vector into the chromosome. 
Longer regions of homology, and greater degrees of sequence similarity, may 
increase the efficiency of homologous recombination. 

3 0 A "polypeptide" is a polymeric compound comprised of covalently linked 
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amino acid residues. Amino acids have the following general structure: 



R_C-COOH 



NH2 



10 



20 



25 



Amino acids are classified into seven groups on the basis of the side chain R: (1) 
aliphatic side chains, (2) side chains containing a hydroxylic (OH) group, (3) side 
chains containing sulfur atoms, (4) side chains containing an acidic or amide 
group, (5) side chains containing a basic group, (6) side chains containing an 
aromatic ring, and (7) proline, an imino acid in which the side chain is fused to the 
amino group. A polypeptide of the invention preferably comprises at least about 
14 amino acids. 

A "protein" is a polypeptide which plays a structural or functional role in a 
living cell. 

A "variant" of a polypeptide or protein is any analogue, fragment, 
derivative, or mutant which is derived from a polypeptide or protein and which 
retains at least one biological property of the polypeptide or protein. Different 
variants of the polypeptide or protein may exist in nature. These variants may be 
allelic variations characterized by differences in the nucleotide sequences of the 
structural gene coding for the protein, or may involve differential splicing or post- 
translational modification. The skilled artisan can produce variants having single 
or multiple amino acid substitutions, deletions, additions, or replacements. These 
variants may include, inter alia: (a) variants in which one or more amino acid 
residues are substituted with conservative or non-conservative amino acids, (b) 
variants in which one or more amino acids are added to the polypeptide or protein, 
(c) variants in which one or more of the amino acids includes a substituent group, 
and (d) variants in which the polypeptide or protein is fused with another 
polypeptide such as serum albumin. The techniques for obtaining these variants, 
including genetic (suppressions, deletions, mutations, etc.), chemical, and 
enzymatic techniques, are known to persons having ordinary skill in the art. A 
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variant of the invention preferably comprises at least about 14 amino acids. 

If such allelic variations, analogues, fragments, derivatives, mutants, and 
modifications, including alternative mRNA splicing forms and alternative post- 
translational modification forms result in derivatives of the polypeptide v^hich 
5 retain any of the biological properties of the polypeptide, they are intended to be 
included within the scope of this invention. 

A "heterologous protein" refers to a protein not naturally produced in the 

cell. 

Two amino acid sequences are "substantially homologous" or "substantially 
10 similar" when greater than about 40% of the amino acids are identical, or greater 
than 60% are similar (functionally identical). Preferably, the similar or 
homologous sequences are identified by alignment using, for example, the GCG 
(Genetics Computer Group, Program Manual for the GCG Package, Version 7, 
Madison, Wisconsin) pileup program. 
15 The term "corresponding to" is used herein to refer to similar or 

homologous sequences, whether the exact position is identical or different from the 
molecule to which the similarity or homology is measured. A nucleic acid or 
amino acid sequence alignment may include spaces. Thus, the term 
"corresponding to" refers to the sequence similarity, and not the numbering of the 
2 0 amino acid residues or nucleotide bases. 

A "derivative" is understood to mean, for the purposes of the present 
invention, the products comprising, for example, modifications at the level of the 
primary structure, such as deletions of one or more residues, substitutions of one or 
more residues, and/or modifications at the level of one or more residues. The 

2 5 number of residues affected by the modifications may be, for example, from 1, 2 or 

3 to 10, 20, or 30 residues. The term derivative also comprises the molecules 
comprising additional internal or terminal parts, of a peptide nature or otherwise. 
They may be in particular active parts, markers, amino acids, such as methionine at 
position -1. The term derivative also comprises the molecules comprising 

3 0 modifications at the level of the tertiary structure (N-terminal end, and the like). 
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The term derivative also comprises sequences homologous to the sequence 
considered, derived from other cellular sources, and in particular from cells of 
human origin, or from other organisms, and possessing activity of the same type or 
of substantially similar type. Such homologous sequences may be obtained by 
5 hybridization experiments. The hybridizations may be performed based on nucleic 
acid libraries, using, as probe, the native sequence or a fragment thereof, under 
conventional stringency conditions (Maniatis et al.. General Molecular Biology 
Techniques), or preferably under high stringency conditions. 

These derivatives may be generated for various purposes, such as in 

1 0 particular that of increasing the affinity of the peptide for its site(s) of interaction, 
that of enhancing its production levels, or that of conferring new pharmacokinetic 
and/or biological properties on it. 

Preferably, the derivatives share at least one structural homology with a 
polypeptide according to the invention and more preferably at least at the level of 

1 5 the regions bordering the bHLH domain, and conserve its biological property of 
transcriptional activator. The derivatives may also offer new properties to the 
claimed polypeptides (labelling, and the like), or enhance their properties (stability, 
biological activity and the like). 

In accordance with the present invention there may be employed 

2 0 conventional molecular biology, microbiology, and recombinant DNA techniques 
within the skill of the art. Such techniques are explained fiilly in the literature. 
See, e.g., Sambrook, Fritsch & Maniatis, Molecular Cloning: A Laboratory 
Manual, Second Edition (1989) Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, New York (herein "Sambrook et al., 1989"); DNA Cloning: A Practical 

2 5 Approach, Volumes I and II (D.N. Glover ed. 1985); Oligonucleotide Synthesis 

(M.J. Gait ed. 1984); Nucleic Acid Hybridization [B.D. Hames & S J. Higgins eds. 
(1985)]; Transcription And Translation [B.D. Hames & S.J. Higgins, eds. (1984)]; 
Animal Cell Culture [R.I. Freshney, ed. (1986)]; Immobilized Cells And Enzymes 
[IRL Press, (1986)]; B. Perbal, A Practical Guide To Molecular Cloning (1984); 

3 0 F.M. Ausubel et al. (eds.). Current Protocols in Molecular Biology^ John Wiley & 
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Sons, Inc. (1994). 

A gene encoding a polypeptide according to the invention, whether 
genomic DNA or cDNA, can be isolated from any source, particularly from a 
human cDNA or genomic library. General methods for obtaining such a gene are 
well known in the art, as described above (see, e,g., Sambrook et al., 1989, supra). 

Accordingly, any animal cell potentially can serve as the nucleic acid 
source for the molecular cloning of a gene according to the invention. The DNA 
may be obtained by standard procedures known in the art from cloned DNA (e.g., a 
DNA "library"), and preferably is obtained from a cDNA library prepared from 
tissues with high level expression of the protein (e.g. , heart, pancreas and skeletal 
muscle cDNA, since these are the cells that evidence high levels of expression of a 
polypeptide according to the invention), by chemical synthesis, by cDNA cloning, 
or by the cloning of genomic DNA, or fragments thereof, purified from the desired 
cell (See, for example, Sambrook et al., 1989, supra; Glover, D.M. (ed.), 1985, 
DNA Cloning: A Practical Approach, MRL Press, Ltd., Oxford, U.K. Vol. I, II). 
Clones derived from genomic DNA may contain regulatory and intron DNA 
regions in addition to coding regions; clones derived from cDNA will not contain 
intron sequences. Whatever the source, the gene should be molecularly cloned into 
a suitable vector for propagation of the gene. 

Once the DNA fragments are generated, identification of the specific DNA 
fragment containing the desired gene of the invention may be accomplished in a 
number of ways. For example, DNA fragments may be screened by nucleic acid 
hybridization to a labeled probe (Benton and Davis, 1977, Science 196:180; 
Grunstein and Hogness, 1975, Proc. Natl Acad. Sci. U.S.A. 72:3961), Those DNA 
fragments with substantial homology to the probe will hybridize. As noted above, 
the greater the degree of homology, the more stringent hybridization conditions can 
be used. In a specific embodiment. Northern hybridization conditions are used to 
identify mRNA splicing variants of a gene according to the invention. 

Further selection can be carried out on the basis of the properties of the 
gene, e.g., if the gene encodes a protein product having the isoelectric. 
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electrophoretic, amino acid composition, or partial amino acid sequence of a 
polypeptide according to the invention as disclosed herein. Thus, the presence of 
the gene may be detected by assays based on the physical, chemical, or 
immunological properties of its expressed product. For example, cDNA clones, or 
5 DNA clones which hybrid-select the proper mRNAs, can be selected which 
produce a protein that, e.g., has similar or identical electrophoretic migration, 
isoelectric focusing or non-equilibrium pH gel electrophoresis behavior, proteolytic 
digestion maps, or antigenic properties as known for a polypeptide according to the 
invention, hi a specific embodiment, the expressed protein is recognized by a 
1 0 polyclonal antibody that is generated against an epitope specific for human ngn3 
peptide. 

The present invention also relates to genes (e.g., cDNAs) encoding allelic 
variants, splicing variants, analogs, and derivatives of a polypeptide according to 
the invention, that have the same or homologous functional activity as a 

15 polypeptide according to the invention, and homo logs thereof from other species. 
The production and use of derivatives and analogs related to polypeptides of the 
invention are within the scope of the present invention, hi a specific embodiment, 
the derivative or analog is functionally active, i,e., capable of exhibiting one or 
more functional activities associated with a full-length, wild-type Relax or hngn3 

2 0 polypeptide of the invention, 

A derivative of a polypeptide according to the invention can be made by 
altering encoding nucleic acid sequences by substitutions, additions or deletions 
that provide for functionally equivalent molecules. Preferably, derivatives are 
made that have enhanced or increased functional activity relative to native Relax or 

2 5 hngn3 polypeptide. 

Due to the degeneracy of nucleotide coding sequences, other DNA 
sequences which encode substantially the same amino acid sequence as a Relax 
gene or an hngn3 gene, including an amino acid sequence that contains a single 
amino acid variant, may be used in the practice of the present invention. These 

3 0 include but are not limited to allelic genes, homologous genes fi-om other species. 
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and nucleotide sequences comprising all or portions of a gene according to the 
invention that is altered by the substitution of different codons that encode the 
same amino acid residue within the sequence, thus producing a silent change. 
Likewise, the derivatives of the invention include, but are not limited to, those 
5 containing, as a primary amino acid sequence, all or part of the amino acid 
sequence of a Relax polypeptide or an hngn3 polypeptide including altered 
sequences in which functionally equivalent amino acid residues are substituted for 
residues within the sequence resulting in a conservative amino acid substitution. 
For example, one or more amino acid residues within the sequence can be 

1 0 substituted by another amino acid of a similar polarity, which acts as a functional 
equivalent, resulting in a silent alteration. Substitutes for an amino acid within the 
sequence may be selected from other members of the class to which the amino acid 
belongs. For example, the nonpolar (hydrophobic) amino acids include alanine, 
leucine, isoleucine, valine, proline, phenylalanine, tryptophan and methionine. 

1 5 Amino acids containing aromatic ring structures are phenylalanine, tryptophan, and 
tyrosine. The polar neutral amino acids include glycine, serine, threonine, cysteine, 
tyrosine, asparagine, and glutamine. The positively charged (basic) amino acids 
include arginine, lysine and histidine. The negatively charged (acidic) amino acids 
include aspartic acid and glutamic acid. Such alterations will not be expected to 

2 0 affect apparent molecular weight as determined by polyacrylamide gel 
electrophoresis, or isoelectric point. 

Particularly preferred substitutions are: 

- Lys for Arg and vice versa such that a positive charge may be maintained; 

- Glu for Asp and vice versa such that a negative charge may be maintained; 

2 5 - Ser for Thr such that a free -OH can be maintained; and 

- Gin for Asn such that a free CONH2 can be maintained. 

Amino acid substitutions may also be introduced to substitute an amino 
acid with a particularly preferable property. For example, a Cys may be introduced 
a potential site for disulfide bridges with another Cys. A His may be introduced as 

3 0 a particularly *'catal)4ic" site {i.e., His can act as an acid or base and is the most 
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common amino acid in biochemical catalysis). Pro may be introduced because of 
its particularly planar structure, which induces p-tums in the protein's structure. 

The genes encoding derivatives and analogs of the invention can be 
produced by various methods known in the art. The manipulations that result in 
their production can occur at the gene or protein level. For example, a cloned gene 
sequence according to the invention can be modified by any of numerous strategies 
known in the art (Sambrook et al., 1989, supra). The sequence can be cleaved at 
appropriate sites with restriction endonuclease(s), followed by further enzymatic 
modification if desired, isolated, and ligated in vitro. Li the production of the gene 
encoding a derivative or analog of a polypeptide according to the invention, care 
should be taken to ensure that the modified gene remains within the same 
translational reading fi-ame as the wild-type gene, uninterrupted by translational 
stop signals, in the gene region where the desired activity is encoded. 

Additionally, the nucleic acid sequence encoding a polypeptide according 
to the invention can be mutated in vitro or in vivo, to create and/or destroy 
translation, initiation, and/or termination sequences, or to create variations in 
coding regions and/or form new restriction endonuclease sites or destroy 
preexisting ones, to facilitate further in vitro modification. Preferably, such 
mutations enhance the functional activity of the mutated gene product. Any 
technique for mutagenesis known in the art can be used, including but not limited 
to, in vitro site-directed mutagenesis (Hutchinson, C, et al., 1978, J. BioL Chem. 
253:6551; ZoUer and Smith, 1984, DNA 3:479-488; Oliphant et al., 1986, Gene 
44:177; Hutchinson et al., 1986, Proc. Natl. Acad. Sci. U.S.A. 83:710), use of 
TAB® linkers (Pharmacia), etc. PGR techniques are preferred for site directed 
mutagenesis (see Higuchi, 1989, "Using PGR to Engineer DNA", in PCR 
Technology: Principles and Applications for DNA Amplification, H. Erlich, ed., 
Stockton Press, Ghapter 6, pp. 61-70). 

The identified and isolated gene can then be inserted into an appropriate 
cloning vector. A large number of vector-host systems known in the art may be 
used. Possible vectors include, but are not limited to, plasmids or modified 
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viruses, but the vector system must be compatible with the host cell used. 
Examples of vectors include, but are not limited to, Escherichia coli, 
bacteriophages such as lambda derivatives, or plasmids such as pBR322 
derivatives or pUC plasmid derivatives, e.g.^ pGEX vectors, pmal-c, pFLAG, etc. 
5 The insertion into a cloning vector can, for example, be accomplished by ligating 
the DNA fragment into a cloning vector that has complementary cohesive termini. 
However, if the complementary restriction sites used to fragment the DNA are not 
present in the cloning vector, the ends of the DNA molecules may be enzymatically 
modified. Altematively, any site desired may be produced by ligating nucleotide 

10 sequences (linkers) onto the DNA termini; these ligated linkers may comprise 

specific chemically synthesized oligonucleotides encoding restriction endonuclease 
recognition sequences. Recombinant molecules can be introduced into host cells 
via transformation, transfection, infection, electroporation, etc., so that many 
copies of the gene sequence are generated. Preferably, the cloned gene is contained 

15 on a shuttle vector plasmid, which provides for expansion in a cloning cell, e.g., E. 
coli, and purification for subsequent insertion into an appropriate expression cell 
line, if such is desired. For example, a shuttle vector, which is a vector that can 
replicate in more than one type of organism, can be prepared for replication in both 
E, coli and Saccharomyces cerevisiae by linking sequences from an E, coli plasmid 

2 0 with sequences form the yeast 2|a plasmid. 

Nucleic Acids Encoding Relax and hngn3 Polypeptides 
The present invention relates to nucleic acids encoding novel basic helix- 
loop-helix polypeptides. Relax and hngn3. Thus, a first subject of the invention is 
a nucleic acid comprising a polynucleotide sequence of a) SEQ ID NO: 1, or of a 

2 5 complementary polynucleotide sequence, or b) SEQ ID NO: 9, or of a 

complementary polynucleotide sequence. 

The invention also relates to a nucleic acid comprising at least 8 
consecutive nucleotides of a polynucleotide sequence of a) SEQ ID NO: 1, or of 
a complementary polynucleotide sequence, or b) SEQ ED NO: 9, or of a 

3 0 complementary polynucleotide sequence. 
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Preferably, a nucleic acid according to the invention comprises at least 
10, 12, 15, 18, 20 to 25, 35, 40, 50, 70, 80, 100, 200, or 500 consecutive 
nucleotides of a polynucleotide sequence of a) SEQ ED NO: 1, or of a 
complementary polynucleotide sequence, or b) SEQ ID NO: 9, or of a 
5 complementary polynucleotide sequence. 

The invention also relates to a nucleic acid having at least 80% nucleotide 
identity with a nucleic acid comprising a polynucleotide sequence of a) 
SEQ ED NO: 1 , or of a complementary polynucleotide sequence, or b) SEQ ED NO: 
9, or of a complementary polynucleotide sequence. 

1 0 The invention also relates to a nucleic acid having at least 85%, preferably 

90%, more preferably 95% and still more preferably 98% nucleotide identity with a 
nucleic acid comprising a polynucleotide sequence of a) SEQ ED NO: 1, or of a 
complementary polynucleotide sequence, or b) SEQ ED NO: 9, or of a 
complementary polynucleotide sequence. 

1 5 The invention also relates to a nucleic acid hybridizing, under high 

stringency conditions, with a polynucleotide sequence of a) SEQ ID NO: 1, or of 
a complementary polynucleotide sequence, or b) SEQ ED NO: 9, or of a 
complementary polynucleotide sequence. 

The invention also relates to nucleic acids, particularly cDNA molecules, 
2 0 which encode the full length rat Relax or human ngn3 proteins. The present 

invention also relates to a cDNA molecule that encodes the novel full length rat 
Relax or human ngn3 proteins. Thus, the invention relates to a nucleic acid 
comprising a polynucleotide sequence of a) SEQ ED NO: 1, or of a complementary 
polynucleotide sequence, or b) SEQ ID NO: 9, or of a complementary 

2 5 polynucleotide sequence. 

The invention also relates to a nucleic acid comprising a polynucleotide 
sequence as depicted in a) SEQ ED NO: 1, or of a complementary polynucleotide 
sequence, or b) SEQ ED NO: 9, or of a complementary polynucleotide sequence. 

According to the invention, a nucleic acid comprising a polynucleotide 

3 0 sequence of SEQ ED NO: 1 encodes a full length rat Relax polypeptide of 214 
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amino acids comprising the amino acid sequence of SEQ ED NO: 8. 

According to the invention, a nucleic acid comprising a polynucleotide 
sequence of SEQ ED NO: 9 encodes a full length human hngn3 polypeptide of 214 
amino acids comprising the amino acid sequence of SEQ ED NO: 10. 

5 

Relax and hngn3 Polypeptides According to the Invention 
The present invention also relates to rat Relax or human ngn3 (hngn3) 
„^ polypeptides according to the invention. 

0 Thus, the present invention relates to a nucleic acid that encodes a 

: f i 

\}{ 1 0 polypeptide comprising an amino acid sequence of either one of SEQ ED NOs: 8 or 

1 10. 

p The invention also relates to a polypeptide comprising an amino acid 

sequence as depicted in either one of SEQ ED NOs: 8 or 10. 
!!; The invention also relates to a polypeptide comprising an amino acid 

^ 15 sequence having at least 80% amino acid identity with a polypeptide comprising 
^ an amino acid sequence of either one of SEQ ID NOs: 8 or 10. 

The invention also relates to a polypeptide having at least 85%, preferably 
90%, more preferably 95% and still more preferably 98% amino acid identity with 
a polypeptide comprising an amino acid sequence of either one of SEQ ID NOs: 8 
2 0 or 10. 

Preferably, a polypeptide according to the invention will have a length of 
15, 18 or 20 to 25, 35, 40, 50, 70, 80, 100 or 200 cxtnsecutive amino acids of a 
polypeptide according to the invention, in particular a polypeptide comprising an 
amino acid sequence of either one of SEQ ED NOs: 8 or 10. 
2 5 Alternatively, a polypeptide according to the invention will comprise a 

fragment having a length of 15, 18, 20, 25, 35, 40, 50, 100 or 200 consecutive 
amino acids of a polypeptide according to the invention, more particularly of a 
polypeptide comprising an amino acid sequence of either one of SEQ ID NOs: 8 
or 10. 
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Expression of Polypeptides of the Invention and Methods of Producing 
Polypeptides of the hivention 
The invention relates to a method for the production of a polypeptide 
comprising an amino acid sequence of either one of SEQ ID NOs: 8 or 10, or of 
a polypeptide or a variant thereof, wherein said method comprises the steps of: 

a) inserting a nucleic acid encoding said polypeptide into an appropriate 

vector; 

b) culturing, in an appropriate culture medium, a previously transformed 
host cell or transfecting a host cell with the recombinant vector of step a); 

c) recovering the conditioned culture medium or lysing the host cell, for 
example by sonication or by osmotic shock; 

d) separating and purifying said polypeptide from said culture medium or 
alternatively from the cell lysates obtained in step c); and 

e) where appropriate, characterizing the recombinant polypeptide produced. 
A specific embodiment of the invention relates to a method for producing a 

polypeptide comprising an amino acid sequence of either one of SEQ ID NOs: 8 or 
10. 

A polypeptide termed "homologous" to a polypeptide having an amino acid 
sequence comprising either one of SEQ ID NOs: 8 or 10 also forms part of the 
invention. Such a homologous polypeptide comprises an amino acid sequence 
possessing one or more substitutions of an amino acid by an equivalent amino acid, 
relative to either one of SEQ ID NOs: 8 or 10, respectively. 

The polypeptides according to the invention may be characterized by 
binding to an immunoaffinity chromatography column on which the antibodies 
directed against this polypeptide or against a fragment or a variant thereof have 
been previously immobilized. 

According to another aspect, a recombinant polypeptide according to the 
invention may be purified by passing it over an appropriate series of 
chromatography columns, according to methods known to persons skilled in the art 
and described for example in Ausubel et al. (1989). 
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A polypeptide according to the invention may also be prepared by 
conventional chemical synthesis techniques either in homogeneous solution or in 
solid phase. By way of illustration, a polypeptide according to the invention may 
be prepared by the technique either in homogeneous solution described by Houben 
5 Weyl, 1974 (In: Meuthode der Organischen Chemie, E. Wunsch Ed., 15-1:15-11) or 
the solid phase synthesis technique described by Merrifield, 1965 (Nature, 
207(996):522-523 and Science, 150(693): 178-1 85). 

An "equivalent amino acid" according to the present invention will be 
understood to mean for example replacement of a residue in the L form by a 

1 0 residue in the D form or the replacement of a glutamic acid (E) by a pyro-glutamic 
acid according to techniques well known to persons skilled in the art. By way of 
illustration, the synthesis of peptide containing at least one residue in the D form is 
described by Koch, 1977 (Biochem. Biophys. Res. Commun., 74:488-491). 
According to another aspect, two amino acids belonging to the same class, that is 

15 to say two uncharged polar, nonpolar, basic or acidic amino acids, are also 
considered as equivalent amino acids. 

Polypeptides comprising at least one nonpeptide bond such as a retro- 
inverse bond (NHCO), a carba bond (CH2CH2) or a ketomethylene bond (CO-CH2) 
also form part of the invention. 

2 0 Preferably, the polypeptides according to the invention comprising one or 

more additions, deletions, substitutions of at least one amino acid will retain their 
capacity to be recognized by antibodies directed against the nonmodified 
polypeptides. 

The nucleotide sequence coding for a polypeptide according to the 

2 5 invention, or antigenic fragment, derivative or analog thereof, or a functionally 

active derivative, including a chimeric protein, thereof, can be inserted into an 
appropriate expression vector, i.e., sl vector that contains the necessary elements for 
the transcription and translation of the inserted protein-coding sequence. Such 
elements are termed herein a "promoter." Thus, the nucleic acid of the invention is 

3 0 operationally associated with a promoter in an expression vector of the invention. 
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Both cDNA and genomic sequences can be cloned and expressed under control of 
such regulatory sequences. An expression vector also preferably includes a 
replication origin. 

The necessary transcriptional and translational signals can be provided on a 
5 recombinant expression vector, or they may be supplied by the native gene 
encoding a polypeptide according to the invention and/or its flanking regions. 
Potential host-vector systems include but are not hmited to mammalian cell 
systems infected -with virus (e.g., vaccinia virus, adenovirus, etc.); insect cell 
systems infected with virus (e.g., baculovirus); microorganisms such as yeast 

1 0 containing yeast vectors; or bacteria transformed v^ith bacteriophage, DNA, 

plasmid DNA, or cosmid DNA. The expression elements of vectors vary in their 
strengths and specificities. Depending on the host-vector system utilized, any one 
of a number of suitable transcription and translation elements may be used. 
A recombinant polypeptide of the invention, or functional fi*agment, 

1 5 derivative, chimeric construct, or analog thereof, may be expressed 

chromosomally, after integration of the coding sequence by recombination. In this 
regard, any of a number of amplification systems may be used to achieve high 
levels of stable gene expression (See Sambrook et al., 1989, supra). 

The cell into w^hich the recombinant vector comprising the nucleic acid 

2 0 encoding a polypeptide according to the invention is cultured in an appropriate cell 
culture medium under conditions that provide for expression of the polypeptide by 
the cell. Any of the methods previously described for the insertion of DNA 
fi-agments into a cloning vector may be used to construct expression vectors 
containing a gene consisting of appropriate transcriptional/translational control 

2 5 signals and the protein coding sequences. These methods may include in vitro 

recombinant DNA and synthetic techniques and in vivo recombination (genetic 
recombination). 

A nucleic acid encoding a polypeptide according to the invention may be 
operably linked and controlled by any regulatory region, i.e., promoter/enhancer 

3 0 element known in the art, but these regulatory elements must be functional in the 
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host target tumor selected for expression. The regulatory regions may comprise a 
promoter region for functional transcription in the host cell, as well as a region 
situated 3' of the gene of interest, and which specifies a signal for termination of 
transcription and a polyadenylation site. All these elements constitute an 
5 expression cassette. 

Promoters that may be used in the present invention include both 
constitutive promoters and regulated (inducible) promoters. The promoter may be 
naturally responsible for the expression of the nucleic acid. It may also be from a 
heterologous source. Li particular, it may be promoter sequences of eukaryotic or 

10 viral genes. For example, it may be promoter sequences derived from the genome 
of the cell that it is desired to infect. Likewise, it may be promoter sequences 
derived from the genome of a virus, such as adenovirus (El A and MLP), 
cytomegalovirus (CMV), or Rous Sarcoma Virus (RSV). hi addition, the promoter 
may be modified by addition of activating or regulatory sequences or sequences 

15 allowing a tissue-specific or predominant expression (enolase and GFAP promoters 
and the like). Moreover, when the nucleic acid does not contain promoter 
sequences, it may be inserted. 

Some promoters usefiil for practice of this invention are ubiquitous 
promoters (e.g,, HPRT, vimentin, actin, tubulin), intermediate filament promoters 

2 0 (e.g. , desmin, neurofilaments, keratin, GFAP), therapeutic gene promoters (e.g. , 
MDR type, CFTR, factor VTH), tissue-specific promoters (e.g., actin promoter in 
smooth muscle cells), promoters which are preferentially activated in dividing 
cells, promoters which respond to a stimulus (e.g., steroid hormone receptor, 
retinoic acid receptor), tetracycline-regulated transcriptional modulators, 

2 5 cytomegalovirus (CMV) immediate-early, retroviral LTR, metallothionein, SV-40, 

adenovirus El a, and adenovirus major late (MLP) promoters. Tetracycline- 
regulated transcriptional modulators and CMV promoters are described in WO 
96/01313, US 5,168,062 and 5,385,839, the contents of which are incorporated 
herein by reference. 

3 0 More specifically, expression of a polypeptide according to the invention 
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may be controlled by any promoter/enhancer element known in the art, but these 
regulatory elements must be functional in the host selected for expression. 
Promoters which may be used to control gene expression include, but are not 
limited to, the SV40 early promoter region (Benoist and Chambon, 1981, Nature 
5 290:304-3 1 0), the promoter contained in the 3' long terminal repeat of Rous 
sarcoma virus (Yamamoto, et al., 1980, Cell 22:787-797), the herpes thymidine 
kinase promoter (Wagner et al., 1981, Proc. Natl. Acad. Sci. U.S.A. 78:1441- 
1445), the regulatory sequences of the metallothionein gene (Brinster et al., 1982, 
Nature 296:39-42); prokaryotic expression vectors such as the p-lactamase 

10 promoter (Villa-Kamaroff, et ah, 1978, Proc. Natl. Acad. Sci. U.S.A. 75:3727- 
3731), or the tac promoter (DeBoer, et al., 1983, Proc. Natl. Acad. Sci. U.S.A. 
80:21-25); see also "Useful proteins from recombinant bacteria" in Scientific 
American, 1980, 242:74-94; promoter elements from yeast or other fungi such as 
the Gal 4 promoter, the ADC (alcohol dehydrogenase) promoter, PGK 

1 5 (phosphoglycerol kinase) promoter, alkaline phosphatase promoter; and the animal 
transcriptional control regions, which exhibit tissue specificity and have been 
utilized in transgenic animals: elastase I gene control region which is active in 
pancreatic acinar cells (Swifl et al., 1984, Cell 38:639-646; Omitz et al., 1986, 
Cold Spring Harbor Symp. Quant. Biol. 50:399-409; MacDonald, 1987, 

2 0 Hepatology 7:425-515); insulin gene control region which is active in pancreatic 
beta cells (Hanahan, 1985, Nature 315:1 15-122), immunoglobulin gene control 
region which is active in lymphoid cells (Grosschedl et al., 1984, Cell 38:647-658; 
Adames et al., 1985, Nature 318:533-538; Alexander et al., 1987, Mol. Cell. Biol. 
7:1436-1444), mouse mammary tumor virus control region which is active in 

2 5 testicular, breast, lymphoid and mast cells (Leder et al., 1986, Cell 45:485-495), 

albumin gene control region which is active in liver (Pinkert et al., 1987, Genes 
and Devel. 1 :268-276), alpha- fetoprotein gene control region which is active in 
liver (Krumlauf et al., 1985, Mol. Cell. Biol. 5:1639-1648; Hammer et al., 1987, 
Science 235:53-58), alpha 1-antitrypsin gene control region which is active in the 

3 0 liver (Kelsey et al., 1987, Genes and Devel. 1:161-171), beta-globin gene control 
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region which is active in myeloid cells (Mogram et aL, 1985, Nature 315:338-340; 
KoUias et al., 1986, Cell 46:89-94), myelin basic protein gene control region which 
is active in oligodendrocyte cells in the brain (Readhead et al., 1987, Cell 48:703- 
712), myosin light chain-2 gene control region which is active in skeletal muscle 
5 (Sani, 1985, Nature 314:283-286), and gonadotropic releasing hormone gene 
control region which is active in the hypothalamus (Mason et al., 1986, Science 
234:1372-1378). 

Expression vectors containing a nucleic acid encoding a polypeptide of the 
invention can be identified by five general approaches: (a) PCR amplification of 

10 the desired plasmid DNA or specific mRNA, (b) nucleic acid hybridization, (c) 
presence or absence of selection marker gene functions, (d) analyses with 
appropriate restriction endonucleases, and (e) expression of inserted sequences, hi 
the first approach, the nucleic acids can be amplified by PCR to provide for 
detection of the amplified product. Li the second approach, the presence of a 

1 5 foreign gene inserted in an expression vector can be detected by nucleic acid 
hybridization using probes comprising sequences that are homologous to an 
inserted marker gene. In the third approach, the recombinant vector/host system 
can be identified and selected based upon the presence or absence of certain 
"selection marker" gene fiinctions (e.g., p-galactosidase activity, thymidine kinase 

2 0 activity, resistance to antibiotics, transformation phenotype, occlusion body 
formation in baculovirus, etc.) caused by the insertion of foreign genes in the 
vector, hi another example, if the nucleic acid encoding a polypeptide according to 
the invention is inserted within the "selection marker" gene sequence of the vector, 
recombinants containing the nucleic acid insert can be identified by the absence of 

2 5 the gene function. In the fourth approach, recombinant expression vectors are 

identified by digestion with appropriate restriction enzymes. In the fifth approach, 
recombinant expression vectors can be identified by assaying for the activity, 
biochemical, or immunological characteristics of the gene product expressed by the 
recombinant, provided that the expressed protein assumes a fiinctionally active 

3 0 conformation. 
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A wide variety of host/expression vector combinations may be employed in 
expressing the DNA sequences of this invention. Useful expression vectors, for 
example, may consist of segments of chromosomal, non-chromosomal and 
synthetic DNA sequences. Suitable vectors include derivatives of SV40 and 
5 known bacterial plasmids, e.g., E. coli plasmids col El, pCRl, pBR322, pMal-C2, 
pET, pGEX (Smith et al., 1988, Gene 67:31-40), pMB9 and their derivatives, 
plasmids such as RP4; phage DNAS, e.g., the numerous derivatives of phage 1, 
e.g., NM989, and other phage DNA, e.g.. Ml 3 and filamentous single stranded 
phage DNA; yeast plasmids such as the 2m plasmid or derivatives thereof; vectors 

10 useful in eukaryotic cells, such as vectors useful in insect or mammalian cells; 

vectors derived fi"om combinations of plasmids and phage DNAs, such as plasmids 
that have been modified to employ phage DNA or other expression control 
sequences; and the like. 

For example, in a baculovirus expression systems, both non-fusion transfer 

15 vectors, such as but not limited to pVL941 (BamHl cloning site; Summers), 

pVL1393 (BamHl, Smal, Xbal, EcoRl, Nod, XmaUl, BgHl, and Pstl cloning site; 
Livitrogen), pVL1392 {BglH, Pstl, Notl, Xmalll, EcoKL, Xbal, Smal, and BamRl 
cloning site; Summers and Invitrogen), and pBluejSacIU (BamHl, BglR, Pstl, Ncol, 
and HindUl cloning site, with blue/white recombinant screening possible; 

2 0 Invitrogen), and ftision transfer vectors, such as but not limited to pAc700 (BamHl 

and Kpnl cloning site, in which the BamHl recognition site begins with the 
initiation codon; Summers), pAcTOl and pAc702 (same as pAc700, with different 
reading fi-ames), pAc360 (BamHl cloning site 36 base pairs downstream of a 
polyhedrin initiation codon; Invitrogen(195)), and pBlueBacHisA, B, C [three 
2 5 different reading frames, with BamHl, Bglll, Pstl, Ncol, and HindUl cloning site, 
an N-terminal peptide for ProBond purification, and blue/white recombinant 
screening of plaques; Invitrogen (220)] can be used. 

Mammalian expression vectors contemplated for use in the invention 
include vectors with inducible promoters, such as the dihydrofolate reductase 

3 0 (DHFR) promoter, e.g., any expression vector with a DHFR expression vector, or a 



o 



o 



ST96042A US 



39 



■S 



]^ 15 



20 



25 



Z)//F7?/methotrexate co-amplification vector, such as pED (Pstl^ Sail, Sbal, Smal, 
and EcoRl cloning site, with the vector expressing both the cloned gene and 
DHFR\ see Kaufman, Current Protocols in Molecular Biology, 16.12 (1991). 
Alternatively, a glutamine synthetase/methionine sulfoximine co-amplification 
vector, such as pEE14 (HindBl, Xbal, Smal, Sbal, EcoRl, and BcH cloning site, in 
which the vector expresses glutamine synthase and the cloned gene; Celltech). In 
another embodiment, a vector that directs episomal expression under control of 
Epstein Barr Virus (EBV) can be used, such as pREP4 {BamUl, Sfil, Xhol, Natl, 
Nhel, Hindm, Nhel, Pvull, and Kpnl cloning site, constitutive Rous Sarcoma Virus 
Long Terminal Repeat (RSV-LTR) promoter, hygromycin selectable marker; 
hivitrogen), pCEP4 (BamHl, Sfil, Xhol, Notl, Nhel, Hindm, Nhel, PvwE, and Kpnl 
cloning site, constitutive human cytomegalovirus (hCMV) immediate early gene, 
hygromycin selectable marker; Invitrogen), pMEP4 (Kpnl, Pvul, Nhel, HindUl, 
Notl, Xhol, Sfil, BamHl cloning site, inducible methallothionein Ha gene promoter, 
hygromycin selectable marker: hivitrogen), pREP8 (BamHl , Xhol, Notl, HindUl, 
Nhel, and Kpnl cloning site, RSV-LTR promoter, histidinol selectable marker; 
Invitrogen), pREP9 (Kpnl, Nhel, HindlU, Notl, Xhol, Sfil, and BamHI cloning site, 
RSV-LTR promoter, G418 selectable marker; Invitrogen), and pEBVHis (RSV- 
LTR promoter, hygromycin selectable marker, N-terminal peptide purifiable via 
ProBond resin and cleaved by enterokinase; Invitrogen). Selectable mammalian 
expression vectors for use in the invention include pRc/CMV (HindUl, BstXl, Notl, 
Sbal, and Apal cloning site, G418 selection; Invitrogen), pRc/RSV (HindUl, Spel, 
BstXl, Notl, Xbal cloning site, G418 selection; Invitrogen), and others. Vaccinia 
virus mammalian expression vectors (see, Kaufman, 1991, supra) for use 
according to the invention include but are not limited to pSCl 1 (Smal cloning site, 
TK- and p-gal selection), pMJ601 (Sail, Smal, Afll, Narl, BspMU, BamUl, Apal, 
Nhel, SacU, Kpnl, and HindUl cloning site; TK- and p-gal selection), and 
pTKgptFlS (EcoRI, Pstl, Sail, Accl, HindU, Sbal, BarnlU, and Hpa cloning site, 
TK or XPRT selection). 

Yeast expression systems can also be used according to the invention to 
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express a polypeptide according to the invention. For example, the non-fusion 
pYES2 vector (Xbal, Sphl, Shol, Nod, GstXl, EcoRl, BstXl, BamHl, Sad, Kpn\, 
and HindlU cloning sit; Invitrogen) or the fusion pYESHisA, B, C {Xbal, Sphl, 
Shol, Notl, BstXl, EcoRl, BamUl, Sad, Kpnl, and HindlU cloning site, N-terminal 
5 peptide purified with ProBond resin and cleaved with enterokinase; hivitrogen), to 
mention just two, can be employed according to the invention. 

Once a particular recombinant DNA molecule is identified and isolated, 
several methods known in the art may be used to propagate it. Once a suitable host 
system and growth conditions are established, recombinant expression vectors can 

10 be propagated and prepared in quantity. As previously explained, the expression 
vectors which can be used include, but are not limited to, the following vectors or 
their derivatives: human or animal viruses such as vaccinia virus or adenovirus; 
insect viruses such as baculovirus; yeast vectors; bacteriophage vectors {e.g., 
lambda), and plasmid and cosmid DNA vectors, to name but a few. 

15 Li addition, a host cell strain may be chosen which modulates the 

expression of the inserted sequences, or modifies and processes the gene product in 
the specific fashion desired. Different host cells have characteristic and specific 
mechanisms for the translational and post-translational processing and 
modification of proteins. Appropriate cell lines or host systems can be chosen to 

2 0 ensure the desired modification and processing of the foreign protein expressed. 
Expression in yeast can produce a biologically active product. Expression in 
eukaryotic cells can increase the likelihood of "native" folding. Moreover, 
expression in mammalian cells can provide a tool for reconstituting, or 
constituting, activity of a polypeptide according to the invention. Furthermore, 

2 5 different vector/host expression systems may affect processing reactions, such as 

proteolytic cleavages, to a different extent. 

Vectors are introduced into the desired host cells by methods known in the 
art, e.g., transfection, electroporation, microinjection, transduction, cell fiasion, 
DEAE dextran, calcium phosphate precipitation, lipofection (lysosome fiasion), use 

3 0 of a gene gun, or a DNA vector transporter (see, e.g., Wu et al., 1992, J. Biol. 




o 

ST96042A US 

41 

Chem. 267:963-967; Wu and Wu, 1988, J. Biol. Chem. 263:14621-14624; Hartmut 
et al., Canadian Patent Application No. 2,012,31 1, filed March 15, 1990). 

Soluble forms of the protein can be obtained by collecting culture fluid, or 
solubilizing inclusion bodies, e.g., by treatment with detergent, and if desired 
5 sonication or other mechanical processes, as described above. The solubilized or 
soluble protein can be isolated using various techniques, such as polyacrylamide 
gel electrophoresis (PAGE), isoelectric focusing, 2-dimensional gel 
electrophoresis, chromatography (e.g., ion exchange, affinity, immunoaffinity, and 
sizing column chromatography), centrifugation, differential solubility, 
1 0 immunoprecipitation, or by any other standard technique for the purification of 
proteins. 




l^tf^odiesloTolvpeptid^s'Tjf^ 
^^/^ A polypeptide according to the invention producecLrec^ombinantly or by 
1 5 chemical synthesis, and fi*agments or other derivatives or analogs thereof, including 
fusion proteins, may be used as an antigen or immunogen to generate antibodies. 
Preferably, the antibodies specifically bind a Relax or ngn3 polypeptide according 
to the invention. More preferably, the antibodies specifically bind human ngn3, 



2 0 A molecule is "antigenic" when it is capable of specifically interacting with 

an antigen recognition molecule of the immune system, such as an 
immunoglobulin (antibody) or T cell antigen receptor. An antigenic polypeptide 
contains at least about 5, and preferably at least about 10, amino acids. An 
antigenic portion of a molecule can be that portion that is immunodominant for 

2 5 antibody or T cell receptor recognition, or it can be a portion used to generate an 

antibody to the molecule by conjugating the antigenic portion to a carrier molecule 
for immunization. A molecule that is antigenic need not be itself immunogenic, 
i.e., capable of eliciting an immune response without a carrier. 

Such antibodies include but are not limited to polyclonal, monoclonal, 

3 0 chimeric, single chain. Fab fi-agments, and an Fab expression library. The 
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antibodies of the invention may be cross reactive, e.g., they may recognize Relax or 
ngn3 from different species. Polyclonal antibodies have greater likelihood of cross 
reactivity. Alternatively, an antibody of the invention may be specific for a single 
form of ngn3, such as human ngn3. Preferably, such an antibody is specific for 
5 human ngn3. 

Various procedures known in the art may be used for the production of 
polyclonal antibodies. For the production of antibody, various host animals can be 
immunized by injection with a polypeptide according to the invention, or a 
derivative {e.g. , fi-agment or fusion protein) thereof, including but not limited to 

10 rabbits, mice, rats, sheep, goats, etc. ha one embodiment, the polypeptide 
according to the invention or fi-agment thereof can be conjugated to an 
immunogenic carrier, e.g., bovine serum albumin (BSA) or keyhole limpet 
hemocyanin (KLH). Various adjuvants may be used to increase the immunological 
response, depending on the host species, including but not limited to Freund's 

15 (complete and incomplete), mineral gels such as aluminum hydroxide, surface 
active substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil 
emulsions, keyhole limpet hemocyanins, dinitrophenol, and potentially useful 
human adjuvants such as BCG {bacille Calmette-Guerin) and Corynebacterium 
parvum, 

2 0 For preparation of monoclonal antibodies directed toward a polypeptide 

according to the invention, or fragment, analog, or derivative thereof, any 
technique that provides for the production of antibody molecules by continuous 
cell lines in culture may be used. These include but are not limited to the 
hybridoma technique originally developed by Kohler and Milstein [Nature 

2 5 256:495-497 (1975)], as well as the trioma technique, the human B-cell hybridoma 

technique [Kozbor et al., Immunology Today 4:72 1983); Cote et al., Proc. Natl. 
Acad. Sal U.S.A. 80:2026-2030 (1983)], and the EBV-hybridoma technique to 
produce human monoclonal antibodies [Cole et al., in Monoclonal Antibodies and 
Cancer Therapy, Alan R. Liss, Lie, pp. 77-96 (1985)]. In an additional 

3 0 embodiment of the invention, monoclonal antibodies can be produced in germ-free 
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animals [International Patent Publication No. WO 89/12690, published 28 
December 1989]. In fact, according to the invention, techniques developed for the 
production of "chimeric antibodies" [Morrison et al., J. Bacteriol. 159:870 (1984); 
Neuberger et al., Nature 312:604-608 (1984); Takeda et al.. Nature 'iU-ASl-ASA 
5 (1985)] by splicing the genes from a mouse antibody molecule specific for a 

polypeptide according to the invention together with genes from a human antibody 
molecule of appropriate biological activity can be used; such antibodies are within 
the scope of this invention. Such human or humanized chimeric antibodies are 
preferred for use in therapy of human diseases or disorders (described infra), since 

1 0 the human or humanized antibodies are much less likely than xeno genie antibodies 
to induce an immune response, in particular an allergic response, themselves. 

According to the invention, techniques described for the production of 
single chain Fv (scFv) antibodies [U.S. Patent Nos. 5,476,786 and 5,132,405 to 
Huston; U.S. Patent 4,946,778] can be adapted to produce polypeptide-specific 

15 single chain antibodies. An additional embodiment of the invention utilizes the 
techniques described for the construction of Fab expression libraries [Huse et al.. 
Science 246:1275-1281 (1989)] to allow rapid and easy identification of 
monoclonal Fab fragments with the desired specificity for a polypeptide according 
to the invention, or its derivatives, or analogs. 

2 0 Antibody fragments that contain the idiotype of the antibody molecule can 

be generated by known techniques. For example, such fragments include but are 
not limited to: the F(ab')2 fragment which can be produced by pepsin digestion of 
the antibody molecule; the Fab' fragments which can be generated by reducing the 
disulfide bridges of the F(ab')2 fragment, and the Fab fragments which can be 

2 5 generated by treating the antibody molecule with papain and a reducing agent. 

In the production of antibodies, screening for the desired antibody can be 
accomplished by techniques known in the art, e,g., radioimmunoassay, ELISA 
(enzyme-linked immunosorbent assay), "sandwich" immunoassays, 
immunoradiometric assays, gel diffusion precipitin reactions, immunodiffusion 

3 0 assays, in situ immunoassays (using colloidal gold, enzyme or radioisotope labels, 
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for example), western blots, precipitation reactions, agglutination assays (e,g.^ gel 
agglutination assays, hemagglutination assays), complement fixation assays, 
immunofluorescence assays, protein A assays, and immunoelectrophoresis assays, 
etc. In one embodiment, antibody binding is detected by detecting a label on the 
5 primary antibody. In another embodiment, the primary antibody is detected by 
detecting binding of a secondary antibody or reagent to the primary antibody. In a 
further embodiment, the secondary antibody is labeled. Many means are known in 
the art for detecting binding in an immunoassay and are within the scope of the 
present invention. For example, to select antibodies that recognize a specific 
1 0 epitope of a polypeptide according to the invention, one may assay generated 

hybridomas for a product that binds to a polypeptide fragment conmprising such an 
epitope. For selection of an antibody specific to a polypeptide of the invention 
fi-om a particular species of animal, one can select on the basis of positive binding 
with the polypeptide expressed by or isolated fi*om cells of that species of animal. 

15 

The foregoing antibodies can be used in methods known in the art relating 
to the localization and activity of a polypeptide of the invention, e.g., for Western 
blotting, imaging the polypeptide in situ, measuring levels thereof in appropriate 
physiological samples, etc., using any of the detection techniques mentioned above 
2 0 or known in the art. 

In a specific embodiment, antibodies that agonize or antagonize the activity 
of a Relax or hngn3 polypeptide can be generated. Such antibodies can be tested 
using the assays for identifying ligands. In particular, such antibodies can be scFv 
antibodies expressed intracellularly. 

2 5 The present invention relates to an antibody directed against a polypeptide 

comprising an amino acid sequence of 

1) either one of SEQ ID NOs: 8 or 10, 

2) a polypeptide fi*agment or variant of a polypeptide comprising an amino 
acid sequence of either one of SEQ ID NOs: 8 or 10, or 

3 0 3) a polypeptide termed "homologous" to a polypeptide comprising either 
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one of SEQ ID NOs: 8 or 10, 

as produced in the trioma technique or the hybridoma technique described by 
Kozbor et al. also forms part of the invention. 

The invention also relates to single-chain Fv antibody fragments (ScFv) as 
5 described in US patent No. 4,946,778 or by Martineau et aL (J Mol Biol, 
280(1):1 17-127, 1998). 

The antibodies according to the invention also comprise antibody fragments 
3 obtained with the aid of phage libraries (Martineau et al,^ Ibid.) or humanized 

; antibodies [Reimann et al (AIDS Res Hum Retroviruses, 13(1 1):933-943, 1997) 

10 and Leger et al (Hum Antibodies, 8(1):3-16, 1997)]. 

^~ The antibody preparations according to the invention are useful in 

J immunological detection tests intended for the identification of the presence and/or 

□ of the quantity of antigens present in a sample. 

An antibody according to the invention may comprise, in addition, a 
fi 1 5 detectable marker that is isotopic or nonisotopic, for example fluorescent, or may 

be coupled to a molecule such as biotin, according to techniques well known to 
persons skilled in the art. 

Thus, another subject of the invention is a method of detecting the presence 
of a polypeptide according to the invention in a sample, said method comprising 
2 0 the steps of: 

a) bringing the sample to be tested into contact with an antibody directed 

against 

1) a polypeptide comprising an amino acid sequence of either one of 
SEQ ID NOs: 8 or 10, 

2 5 2) a polypeptide fragment or variant of a polypeptide comprising an 

amino acid sequence of either one of SEQ ID NOs: 8 or 10, or 

3) a polypeptide termed "homologous" to a polypeptide comprising 
either one of SEQ ID NOs: 8 or 10, and 

b) detecting the antigen/antibody complex formed. 

3 0 The invention also relates to a box or kit for diagnosis or for detecting the 
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presence of a polypeptide in accordance with the invention in a sample, said box 
comprising: 

a) an antibody directed against 

1) a polypeptide comprising an amino acid sequence of either one of 
5 SEQIDNOs: 8 or 10, 

2) a polypeptide fragment or variant of a polypeptide comprising an 
amino acid sequence of either one of SEQ ID NOs: 8 or 10, or 

3) a polypeptide termed "homologous" to a polypeptide comprising 
either one of SEQ ID NOs: 8 or 10, 

10 and 

b) a reagent allowing the detection of the antigen/antibody complex 

formed. 




Gene Therapy and Transgenic Vectors 
15 The invention also relates to a recombinant vector comprising a nucleic 

acid according to the invention. "Vector" for the purposes of the present invention 
will be understood to mean a circular or linear DNA or RNA molecule that is either 
in single-stranded or double-stranded form. 

Preferably, such a recombinant vector will comprise a nucleic acid selected 
2 0 from the group consisting of 

a) a nucleic acid comprising a polynucleotide sequence of either one of 
SEQ ED NOs: 1 or 9, or of a complementary polynucleotide sequence, 

b) a nucleic acid comprising a polynucleotide sequence as depicted in either 
one of SEQ ID NOs: 1 or 9, or of a complementary polynucleotide sequence, 

2 5 c) a nucleic acid having at least eight consecutive nucleotides of a nucleic 

acid comprising a polynucleotide sequence of 1) SEQ ID NO: 1, or of a 
complementary polynucleotide sequence, or 2) SEQ ID NO: 9, or of a 
complementary polynucleotide sequence, 

d) a nucleic acid having at least 80% nucleotide identity with a nucleic acid 

3 0 comprising a polynucleotide sequence of either one of SEQ ID NOs: 1 or 9, or of a 
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complementary polynucleotide sequence, 

e) a nucleic acid having 85%, 90%, 95%, or 98% nucleotide identity with a 
nucleic acid comprising a polynucleotide sequence of either one of SEQ DD NOs: 1 
or 9, or of a complementary polynucleotide sequence, 
5 f) 3. nucleic acid hybridizing, under high stringency hybridization 

conditions, with a nucleic acid comprising a polynucleotide sequence of either one 
of SEQ ID NOs: 1 or 9, or of a complementary polynucleotide sequence, and 
^ g) a nucleic acid encoding a polypeptide comprising an amino acid 

;fl sequence of either one of SEQ ID NOs: 8 or 10. 

"f) 10 According to a first embodiment, a recombinant vector according to the 

t„| invention is used to amplify a nucleic acid inserted therein, following 

p transformation or transfection of a desired cellular host. 

According to a second embodiment, a recombinant vector according to the 
™- invention corresponds to an expression vector comprising, in addition to a nucleic 

15 acid in accordance with the invention, a regulatory signal or nucleotide sequence 
1^ that directs or controls transcription and/or translation of the nucleic acid and its 

encoded mRNA. 

According to a preferred embodiment, a recombinant vector according to 
the invention will comprise in particular the following components: 
2 0 (1) an element or signal for regulating the expression of the nucleic acid to 

be inserted, such as a promoter and/or enhancer sequence; 

(2) a nucleotide coding region comprised within the nucleic acid in 
accordance with the invention to be inserted into such a vector, said coding region 
being placed in phase with the regulatory element or signal described in (1); and 

2 5 (3) an appropriate nucleic acid for initiation and termination of transcription 

of the nucleotide coding region of the nucleic acid described in (2). 

In addition, the recombinant vectors according to the invention may include 
one or more origins for replication in the cellular hosts in which their amplification 
or their expression is sought, markers or selectable markers. 

3 0 The present invention also includes gene therapy by the administration to a 
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patient of a nucleic acid encoding a polypeptide according to the invention. 
Preferably, the nucleic acid encodes a human ngn3 protein. 

The nucleic acids of the invention, where appropriate incorporated in 
vectors, and the pharmaceutical compositions comprising them, may be used for 
5 the treatment of many pathologies or dysfunction. They may be used for the 

transfer and expression of genes in vivo in any type of tissue, especially the nervous 
system. The treatment can, moreover, be targeted in accordance with the pathology 
or dysfunction to be treated (transfer to a particular tissue can, in particular, be 
O determined by the choice of a vector, and expression by the choice of a particular 

10 promoter). The nucleic acids or vectors of the invention are advantageously used 
t| for the production in humans or animals, in vivo and intracellularly, of proteins 

: ifj 

£ capable of acting specifically on various cell functions such as cell proliferation, 

cell differentiation, neurogenesis, and the like. The present invention thus makes it 
~f possible to treat specifically, locally and effectively cell dysfunctions at the origin 

\^ 15 of or resulting from different pathologies affecting neurogenesis. 

As discussed above, a "vector" is any means for the transfer of a nucleic 
acid according to the invention into a host cell. Preferred vectors are viral vectors, 
such as retroviruses, herpes viruses, adenoviruses, and adeno-associated viruses. 
Thus, a gene encoding a polypeptide according to the invention or a fragment 
2 0 thereof is introduced in vivo, ex vivo, or in vitro using a viral vector or through 
direct introduction of DNA. Expression in targeted tissues can be effected by 
targeting the transgenic vector to specific cells, such as with a viral vector or a 
receptor ligand, or by using a tissue-specific promoter, or both. 

Expression vectors of the invention can be used, as pointed out above, both 

2 5 to transfect cells for screening or biological testing of modulators of the 

polypeptides according to the invention, or for delivery of a gene or antisense gene 
according to the invention in vivo or ex vivo for gene therapy, e.g. , to increase or 
decrease the level of activity of a polypeptide of the invention. A vector that 
expresses a scFv of the invention can also be introduced using the techniques 

3 0 discussed below. 
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Viral vectors commonly used for in vivo or ex vivo targeting and therapy 
procedures are DNA-based vectors and retroviral vectors. Methods for 
constructing and using viral vectors are known in the art [see, e.g.. Miller and 
Rosman, BioTechniques 7:980-990 (1992)]. Preferably, the viral vectors are 
5 replication defective, that is, they are unable to replicate autonomously in the target 
cell. Li general, the genome of the replication defective viral vectors which are 
used within the scope of the present invention lack at least one region which is 
necessary for the replication of the virus in the infected cell. These regions can 
either be eliminated (in whole or in part), be rendered non- functional by any 

10 technique known to a person skilled in the art. These techniques include the total 
removal, substitution (by other sequences, in particular by the inserted nucleic 
acid), partial deletion or addition of one or more bases to an essential (for 
replication) region. Such techniques may be performed in vitro (on the isolated 
DNA) or in situ, using the techniques of genetic manipulation or by treatment with 

15 mutagenic agents. Preferably, the replication defective virus retains the sequences 
of its genome that are necessary for encapsulating the viral particles. 

DNA viral vectors include an attenuated or defective DNA virus, such as 
but not limited to herpes simplex virus (HSV), papillomavirus, Epstein Barr virus 
(EBV), adenovirus, adeno-associated virus (AAV), vaccinia virus, and the like. 

2 0 Vectors derived from adenoviruses, retroviruses or AAVs incorporating 

heterologous nucleic acid sequences have been described in the literature [Akli et 
al.. Nature Genetics 3 (1993) 224; Stratford-Perricaudet et al.. Human Gene 
Therapy 1 (1990) 241; EP 185 573, Levrero et al.. Gene 101 (1991) 195; Le Gal La 
Salle et al.. Science 259 (1993) 988; Roemer and Friedmann, Eur. J. Biochem. 208 
25 (1992) 211; Dobson et al., Neuron 5 (1990) 353; Chiocca et al., New Biol. 2 (1990) 
739; Miyanohara et al,. New Biol. 4 (1992) 238; WO91/18088]. 

The present invention therefore also relates to any recombinant virus 
comprising, inserted into its genome, a nucleic acid according to the invention. 

Defective viruses, which entirely or almost entirely lack viral genes, are 

3 0 preferred. The term "defective virus" designates a virus incapable of replicating in 
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the target cell. Generally, the genome of the defective viruses used in the context of 
the present invention therefore lacks at least sequences necessary for the replication 
of the virus in the infected cell. These regions may be either removed (completely 
or partly), or made nonfunctional, or substituted by other sequences and in 
5 particular by the nucleic acid of the invention. Preferably, the defective virus 
conserves, nevertheless, the sequences of its genome that are necessary for the 
encapsidation of the viral particles. Defective virus is not replication competent 
after introduction into a cell, and thus does not lead to a productive viral infection. 
Use of defective viral vectors allows for administration to cells in a specific, 

1 0 localized area, without concem that the vector can infect other cells. Thus, a 

specific tissue can be specifically targeted. Examples of particular vectors include, 
but are not limited to, a defective herpes virus 1 (HSVl) vector [Kaplitt et aL, 
Molec. Cell. Neurosci. 2:320-330 (1991)], defective herpes virus vector lacking a 
glyco-protein L gene [Patent Publication RD 371005 A], or other defective herpes 

1 5 virus vectors [International Patent Publication No. WO 94/21807, published 

September 29, 1994; International Patent Publication No. WO 92/05263, pubhshed 
April 2, 1994]; an attenuated adenovirus vector, such as the vector described by 
Stratford-Perricaudet et al. [J. Clin. Invest. 90:626-630 (1992); see also Le Gal La 
Salle et aL, Science 259:988-990 (1993)]; and a defective adeno-associated virus 

2 0 vector [Samulski et aL, J. Virol 61:3096-3101 (1987); Samulski et aL, J. Virol. 
63:3822-3828 (1989); Lebkowski et aL, Mol. Cell Biol. 8:3988-3996 (1988)]. 

Preferably, for in vivo administration, an appropriate immunosuppressive 
treatment is employed in conjunction with the viral vector, e.g., adenovirus vector, 
to avoid immuno-deactivation of the viral vector and transfected cells. For 

2 5 example, immunosuppressive cytokines, such as interleukin- 1 2 (IL-12), interferon- 

y (IFN-y), or anti-CD4 antibody, can be administered to block humoral or cellular 
immune responses to the viral vectors [see, e.g., Wilson, Nature Medicine (1995)]. 
In addition, it is advantageous to employ a viral vector that is engineered to 
express a minimal number of antigens. 

3 0 Naturally, the invention contemplates delivery of a vector that will express 
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a therapeutically effective amount of a polypeptide of the invention for gene 
therapy applications. The phrase "therapeutically effective amount" is used herein 
to mean an amount sufficient to reduce by at least about 1 5 percent, preferably by 
at least 50 percent, more preferably by at least 90 percent, and most preferably 
5 prevent, a clinically significant deficit in the activity, function and response of the 
host. Altematively, a therapeutically effective amount is sufficient to cause an 
improvement in a clinically significant condition in the host. 

Adenovirus vectors 
10 hi a preferred embodiment, the vector is an adenovirus vector. 

Adenoviruses are eukaryotic DNA viruses that can be modified to efficiently 
deliver a nucleic acid of the invention to a variety of cell types. Various serotypes 
of adenovirus exist, whose structure and properties vary somewhat. Of these 
serotypes, preference is given, within the scope of the present invention, to using 
1 5 type 2 or type 5 human adenoviruses (Ad 2 or Ad 5) or adenoviruses of animal 
origin (see W094/26914). Those adenoviruses of animal origin that can be used 
within the scope of the present invention include adenoviruses of canine, bovine, 
murine (example: Mavl, Beard et al., Virology 75 (1990) 81), ovine, porcine, 
avian, and simian (example: SAV) origin. Preferably, the adenovirus of animal 
2 0 origin is a canine adenovirus, more preferably a CAV2 adenovirus (e.g. Manhattan 
or A26/61 strain (ATCC VR-800), for example). Preferably, adenovimses of 
human or canine or mixed origin are used in the context of the invention. 

Preferably, the replication defective adenoviral vectors of the invention 
comprise the ITRs, an encapsidation sequence and the nucleic acid of interest. Still 

2 5 more preferably, at least the El region of the adenoviral vector is non-fimctional. 

The deletion in the El region preferably extends fi-om nucleotides 455 to 3329 in 
the sequence of the Ad5 adenovirus (PvuII-Bglll fi-agment) or 382 to 3446 (HinfQ- 
Sau3 A fragment). Other regions may also be modified, in particular the E3 region 
(WO95/02697), the E2 region (W094/28938), the E4 region (W094/28152, 

3 0 W094/12649 and WO95/02697), or in any of the late genes L1-L5. The viral gene 
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considered can be made nonfunctional by any technique known to persons skilled 
in the art, and in particular by total suppression, by substitution or partial deletion 
or by addition of one or more bases in the gene(s) considered. 

hi a preferred embodiment, the adenoviral vector has a deletion in the El 
5 region (Ad 1 .0). Examples of El -deleted adenoviruses are disclosed in EP 

1 85,573, the contents of which are incorporated herein by reference, hi another 
preferred embodiment, the adenoviral vector has a deletion in the El and E4 
regions (Ad 3.0). Examples of El/E4-deleted adenoviruses are disclosed in 
WO95/02697 and W096/22378, the contents of which are incorporated herein by 

1 0 reference. In still another preferred embodiment, the adenoviral vector has a 

deletion in the El region into which the E4 region and the nucleic acid sequence of 
interest is inserted (see FR94 13355, the contents of which are incorporated herein 
by reference). According to another preferred embodiment, the exogenous nucleic 
acid sequence is inserted at the level of the deletion in the El region. 

1 5 The replication defective recombinant viruses of the invention can be 

prepared by any technique known to the person skilled in the art (Levrero et al.. 
Gene 101 (1991) 195, EP 185 573; Graham, EMBO J. 3 (1984) 2917). hi 
particular, they can be prepared by homologous recombination between a defective 
adenovirus and a plasmid that carries, inter alia, the DNA sequence of interest. 

2 0 The homologous recombination occurs after co-transfection of the adenovirus and 
plasmid into an appropriate cell line. The cell line which is employed should 
preferably (i) be transformable by the said elements, and (ii) comprise the nucleic 
acid sequences that are able to complement the defective virus genome part, 
preferably in integrated form in order to avoid the risks of recombination. 

2 5 Examples of cell lines which may be used for the preparation of defective 

recombinant adenoviruses are the human embryonic kidney cell line 293 [Graham 
et al., J. Gen. Virol. 36 (1977) 59] which contains the left-hand portion of the 
genome of an Ad5 adenovirus (12%) integrated into its genome, and cell lines 
which are able to complement the El and E4 functions, as described in applications 

3 0 W094/26914 and WO95/02697. Recombinant adenoviruses are recovered and 
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purified using standard molecular biological techniques, which are well known to 
one of ordinary skill in the art. 

Adeno-associated virus vectors 
5 The adeno-associated viruses (AAV) are DNA viruses of relatively small 

size that can integrate, in a stable and site-specific manner, into the genome of the 
cells which they infect. They are able to infect a wide spectrum of cells without 
inducing any effects on cellular growth, morphology or differentiation, and they do 
not appear to be involved in human pathologies. The AAV genome has been 

1 0 cloned, sequenced and characterised. It encompasses approximately 4700 bases 
and contains an inverted terminal repeat (ITR) region of approximately 145 bases 
at each end, which serves as an origin of replication for the virus. The remainder 
of the genome is divided into two essential regions which carry the encapsulation 
functions: the left-hand part of the genome, which contains the rep gene involved 

15 in viral replication and expression of the viral genes; and the right-hand part of the 
genome, which contains the cap gene encoding the capsid proteins of the virus. 

The use of vectors derived from the AAVs for transferring genes in vitro 
and in vivo has been described (see WO 91/18088; WO 93/09239; US 4,797,368, 
US 5,139,941, EP 488 528). These pubhcations describe various AAV-derived 

2 0 constructs in which the rep and/or cap genes are deleted and replaced by a gene of 
interest, and the use of these constructs for transferring the said gene of interest in 
vitro (into cultured cells) or in vivo, (directly into an organism). The replication 
defective recombinant AAVs according to the invention can be prepared by 
cotransfecting a plasmid containing the nucleic acid sequence of interest flanked by 

2 5 two AAV inverted terminal repeat (ITR) regions, and a plasmid carrying the AAV 

encapsulation genes (rep and cap genes), into a cell line which is infected with a 
human helper virus (for example an adenovirus). The AAV recombinants that are 
produced are then purified by standard techniques. 

The invention also relates, therefore, to an AAV-derived recombinant virus 

3 0 whose genome encompasses a sequence encoding a nucleic acid encoding a 
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polypeptide of the invention flanked by the AAV ITRs. The invention also relates 
to a plasmid encompassing a sequence encoding a nucleic acid encoding a 
polypeptide of the invention flanked by two ITRs from an AAV. Such a plasmid 
can be used as it is for transferring the nucleic acid sequence, with the plasmid, 
5 where appropriate, being incorporated into a liposomal vector (pseudo-virus). 

Retrovirus vectors 
hi another embodiment, the gene can be introduced in a retroviral vector, 
e.g., as described in Anderson et al., U.S. Patent No. 5,399,346; Mann et al., 1983, 

10 Cell 33:153; Temin et al., U.S. Patent No. 4,650,764; Temin et al., U.S. Patent No. 
4,980,289; Markowitz et al., 1988, J. Virol. 62:1120; Temin et al., U.S. Patent No. 
5,124,263; EP 453242, EP178220; Bernstein et al. Genet. Eng. 7 (1985) 235; 
McCormick, BioTechnology 3 (1985) 689; Intemational Patent Publication No. 
WO 95/07358, published March 16, 1995, by Dougherty et al.; and Kuo et al., 

15 1993, Blood 82:845. The retroviruses are integrating viruses that infect dividing 
cells. The retrovirus genome includes two LTRs, an encapsulation sequence and 
three coding regions (gag, pol and env). hi recombinant retroviral vectors, the gag, 
pol and env genes are generally deleted, in whole or in part, and replaced with a 
heterologous nucleic acid sequence of interest. These vectors can be constructed 

2 0 from different types of retrovirus, such as, HIV, MoMuLV ("murine Moloney 
leukaemia virus" MS V ("murine Moloney sarcoma virus"), HaSV ("Harvey 
sarcoma virus"); SNV ("spleen necrosis virus"); RSV ("Rous sarcoma virus") and 
Friend virus. Defective retroviral vectors are disclosed in WO95/02697. 

In general, in order to construct recombinant retroviruses containing a 

2 5 nucleic acid sequence, a plasmid is constructed which contains the LTRs, the 

encapsulation sequence and the coding sequence. This construct is used to 
transfect a packaging cell line, which cell line is able to supply in trans the 
retroviral functions that are deficient in the plasmid. In general, the packaging cell 
lines are thus able to express the gag, pol and env genes. Such packaging cell lines 

3 0 have been described in the prior art, in particular the cell line P A3 1 7 
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(US4,861,719); the PsiCRIP cell line (WO90/02806) and the GP^envAm"12 cell 
line (WO89/07150). By way of example of a line which can be used for the 
preparation of defective recombinant retroviruses, there may be mentioned the 
CRIP line (Danos and Mulligan, PNAS 85 (1988) 6460). Next, the viruses that 
5 have multiplied are recovered and purified according to conventional molecular 
biology techniques. In addition, the recombinant retroviral vectors can contain 
modifications within the LTRs for suppressing transcriptional activity as well as 
extensive encapsulation sequences which may include a part of the gag gene 
(Bender et al., J. Virol. 61 (1987) 1639). Recombinant retroviral vectors are 

1 0 purified by standard techniques known to those having ordinary skill in the art. 

Retroviral vectors can be constructed to function as infections particles or 
to undergo a single round of transfection. In the former case, the virus is modified 
to retain all of its genes except for those responsible for oncogenic transformation 
properties, and to express the heterologous gene. Non-infectious viral vectors are 

15 prepared to destroy the viral packaging signal, but retain the structural genes 

required to package the co-introduced virus engineered to contain the heterologous 
gene and the packaging signals. Thus, the viral particles that are produced are not 
capable of producing additional virus. 

Targeted gene delivery is described in International Patent Publication WO 

2 0 95/28494, published October 1 995 . 

Non- viral vectors 

Alternatively, the vector can be introduced in vivo by lipofection. For the 
past decade, there has been increasing use of liposomes for encapsulation and 

2 5 transfection of nucleic acids in vitro. Synthetic cationic lipids designed to limit the 

difficulties and dangers encountered with liposome mediated transfection can be 
used to prepare liposomes for in vivo transfection of a gene encoding a marker 
[Feigner, et. al., Proc. Natl. Acad. Sci. U.S.A. 84:7413-7417 (1987); see Mackey, et 
al., Proc. Natl. Acad. Sci. U.S.A. 85:8027-8031 (1988); Ulmer et al., Science 

3 0 259:1745-1748 (1993)]. The use of cationic lipids may promote encapsulation of 
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negatively charged nucleic acids, and also promote fusion with negatively charged 
cell membranes [Feigner and Ringold, Science 337:387-388 (1989)]. Particularly 
useful lipid compounds and compositions for transfer of nucleic acids are described 
in International Patent Publications W095/18863 and W096/17823, and in U.S. 
Patent No. 5,459,127. The use of lipofection to introduce exogenous genes into the 
specific organs in vivo has certain practical advantages. Molecular targeting of 
liposomes to specific cells represents one area of benefit. It is clear that directing 
transfection to particular cell types w^ould be particularly advantageous in a tissue 
with cellular heterogeneity, such as pancreas, liver, kidney, and the brain. Lipids 
may be chemically coupled to other molecules for the purpose of targeting [see 
Mackey, et. al., supra]. Targeted peptides, e.g., hormones or neurotransmitters, 
and proteins such as antibodies, or non-peptide molecules could be coupled to 
liposomes chemically. 

Other molecules are also useful for facilitating transfection of a nucleic acid 
in vivo, such as a cationic oligopeptide (e.g.. International Patent Publication 
W095/21931), peptides derived fi*om DNA binding proteins (e.g.. International 
Patent Publication WO96/25508), or a cationic polymer (e.g.. International Patent 
Pubhcation W095/21931). 

It is also possible to introduce the vector in vivo as a naked DNA plasmid 
(see U.S. Patents 5,693,622, 5,589,466 and 5,580,859). Naked DNA vectors for 
gene therapy can be introduced into the desired host cells by methods known in the 
art, e.g., transfection, electroporation, microinjection, transduction, cell fusion, 
DEAE dextran, calcium phosphate precipitation, use of a gene gun, or use of a 
DNA vector transporter [see, e.g., Wu et ah, J. Biol. Chem. 267:963-967 (1992); 
Wu and Wu, J. BioL Chem. 263:14621-14624 (1988); Hartmut et al., Canadian 
Patent Application No. 2,012,31 1, filed March 15, 1990; Wilhams et al., Proc. 
Natl Acad. Sci. USA 88:2726-2730 (1991)]. Receptor-mediated DNA dehvery 
approaches can also be used [Curiel et al.. Hum. Gene Ther. 3:147-154 (1992); Wu 
and Wu, J. BioL Chem. 262:4429-4432 (1987)]. Preferred naked DNA vectors 
include pCOR plasmids having a conditional origin of replication (see 
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W097/ 10343), and minicircle plasmids lacking an origin of replication and a 
marker gene (see WO96/26270). 

Recombinant Host Cells of the Invention 
5 The present invention also relates to the use of cells genetically modified 

ex vivo with a virus according to the invention, or of cells producing such viruses, 
implanted in the body, allowing a prolonged and effective expression in vivo of a 
biologically active polypeptide according to the invention. 

The present invention shows that it is possible to incorporate a nucleic 
1 0 acid encoding a polypeptide according to the invention into a viral vector, and that 
these vectors make it possible to effectively express a biologically active, mature 
polypeptide. More particularly, the invention shows that the in vivo expression of a 
polypeptide according to the invention may be obtained by direct administration of 
an adenovirus or by implantation of a producing cell or of a cell genetically 
15 modified by an adenovirus or by a retrovirus incorporating such a nucleic acid. 

hi this regard, another subject of the invention relates to any isolated 
mammalian cell infected with one or more defective recombinant viruses according 
to the invention. More particularly, the invention relates to any population of 
human cells infected with these viruses. These may be in particular cells of blood 
2 0 origin (totipotent stem cells or precursors), fibroblasts, myoblasts, hepatocytes, 
keratinocytes, smooth muscle and endothelial cells, glial cells and the like. 

Another subject of the invention relates to an implant comprising isolated 
mammalian cells infected with one or more defective recombinant viruses 
according to the invention or cells producing recombinant viruses, and an 

2 5 extracellular matrix. Preferably, the implants according to the invention comprise 

10^ to IQIO cells. More preferably, they comprise 10^ to 10^ cells. 

More particularly, in the implants of the invention, the extracellular matrix 
comprises a gelling compound and optionally, a support allowing the anchorage of 
the cells. 

3 0 The invention also relates to an isolated recombinant host cell comprising 
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a nucleic acid of the invention, and more particularly, a nucleic acid comprising 
either one of SEQ ID NOs: 1 or 9, or of a complementary polynucleotide sequence. 

The invention also relates to an isolated recombinant host cell comprising a 
nucleic acid of the invention, and more particularly a nucleic acid comprising a 
5 nucleotide sequence as depicted in either SEQ ID NOs: 1 or 9, or of a 
complementary polynucleotide sequence. 

The invention also relates to an isolated recombinant host cell comprising a 
nucleic acid encoding a polypeptide comprising an amino acid sequence of either 
one of SEQ ID NOs: 8 or 10. 
1 0 According to another aspect, the invention also relates to an isolated 

recombinant host cell comprising a recombinant vector according to the invention. 
Therefore, the invention also relates to a recombinant host cell comprising a 
recombinant vector comprising any of the nucleic acids of the invention. 

Specifically, the invention relates to an isolated recombinant host cell 
1 5 comprising a recombinant vector comprising a nucleic acid comprising either one 
of SEQ ID NOs: 1 or 9, or of a complementary polynucleotide sequence. 

The invention also relates to an isolated recombinant host cell comprising a 
recombinant vector comprising a nucleic acid comprising a polynucleotide 
sequence as depicted in either one of SEQ ID NOs: 1 or 9, or of a complementary 
2 0 polynucleotide sequence. 

The invention also relates to an isolated recombinant host cell comprising a 
recombinant vector comprising a nucleic acid encoding a polypeptide comprising 
an amino acid sequence of either one of SEQ ID NOs: 8 or 10. 

The preferred host cells according to the invention are for example the 

2 5 following: 

a) prokaryotic host cells: strains of Escherichia coli (strain DH5-a), of 
Bacillus subtilis, of Salmonella typhimurium, or strains of genera such as 
Pseudomonas, Streptomyces and Staphylococus ; 

b) eukaryotic host cells: HeLa cells (ATCC No. CCL2), Cv 1 cells (ATCC 

3 0 No. CCL70), COS cells (ATCC No. CRL 1650), Sf-9 cells (ATCC No. CRL 
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1711), CHO cells (ATCC No. CCL-61) or 3T3 cells (ATCC No. CRL-6361). 



Pharmaceutical Compositions and Delivery 
The present invention also relates to pharmaceutical compositions. Such 
5 compositions may comprise a polypeptide, nucleic acid, recombinant vector, 
recombinant virus, recombinant host cell, or implant of the invention, as defined 
above, and a pharmaceutically acceptable excipient, carrier or vehicle. The 
compositions of the invention are particularly suitable for formulation of biological 
material for gene therapy. Thus, in a preferred embodiment, the composition 

1 0 comprises a nucleic acid encoding a human ngnS protein or polypeptide 
comprising amino acid sequence SEQ ID NO: 10. 

Any vector, viral or non- viral, of the invention will preferably be introduced 
in vivo in a pharmaceutically acceptable excipient, vehicle or carrier. The phrase 
"pharmaceutically acceptable" refers to molecular entities and compositions that 

1 5 are physiologically tolerable and do not typically produce an allergic or similar 

untoward reaction, such as gastric upset, dizziness and the like, when administered 
to a human. Preferably, as used herein, the term "pharmaceutically acceptable" 
means approved by a regulatory agency of the Federal or a state government or 
listed in the U.S. Pharmacopeia or other generally recognized pharmacopeia for use 

2 0 in animals, and more particularly in humans. The term "carrier" refers to a diluent, 
adjuvant, excipient, or vehicle with which the compound is administered. Such 
pharmaceutical carriers can be sterile liquids, such as water and oils, including 
those of petroleum, animal, vegetable or synthetic origin, such as peanut oil, 
soybean oil, mineral oil, sesame oil and the like. Water or aqueous solution saline 

2 5 solutions and aqueous dextrose and glycerol solutions are preferably employed as 

carriers, particularly for injectable solutions. Suitable pharmaceutical carriers are 
described in "Remington's Pharmaceutical Sciences" by E.W. Martin. 

The pharmaceutical compositions of the invention may be formulated for 
the purpose of topical, oral, parenteral, intranasal, intravenous, intramuscular, 

3 0 subcutaneous, intraocular, and the like, administration. 
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Preferably, the pharmaceutical compositions contain pharmaceutically 
acceptable vehicles for an injectable formulation. These can be, in particular, 
sterile, isotonic saline solutions (monosodium or disodium phosphate, sodium, 
potassium, calcium or magnesium chloride, and the like, or mixtures of such salts), 
5 or dry, in particular lyophilized, compositions which, on addition, as appropriate, 
of sterilized water or of physiological saline, enable injectable solutions to be 
formed. 

The compositions may be in particular isotonic, sterile, saline solutions 
(monosodium or disodium phosphate, sodium, potassium, calcium or magnesium 

1 0 chloride and the like or mixtures of such salts), or dry, especially freeze-dried 
compositions that upon addition, depending on the case, of sterilized water or 
physiological saline, allow the constitution of injectable solutions. 

The preferred sterile injectable preparations can be a solution or suspension 
in a nontoxic parenterally acceptable solvent or diluent. Examples of 

1 5 pharmaceutically acceptable carriers or vehicles are saline, buffered saline, isotonic 
saline (e.g., monosodium or disodium phosphate, sodium, potassium, calcium or 
magnesium chloride, or mixtures of such salts). Ringer's solution, dextrose, water, 
sterile water, glycerol, ethanol, and combinations thereof. 1,3-butanediol and 
sterile fixed oils are conveniently employed as solvents or suspending media. Any 

2 0 bland fixed oil can be employed including synthetic mono- or di-glycerides. Fatty 
acids such as oleic acid also find use in the preparation of injectables. 

The doses of nucleic acids of the invention, either alone or incorporated in a 
vector, used for administration can be adjusted in accordance with different 
parameters, and in particular in accordance with the mode of administration used, 

2 5 the pathology in question, the gene to be expressed or the desired treatment period. 

Generally speaking, in the case of the recombinant viruses according to the 
invention, these are formulated and administered in the form of doses of between 
lO'* and 10^"* pfu, and preferably 10^ to lO'^ pfu. The term pfu (plaque forming 
unit) corresponds to the infectious power of a solution of virus, and is determined 

3 0 by infection of a suitable cell culture and measurement, generally after 48 hours, of 
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the number of infected cell plaques. The techniques of determination of the pfu 
titre of a viral solution are well documented in the literature. 

The phrase "therapeutically effective amount" is used herein to mean an 
amount sufficient to reduce by at least about 15 percent, preferably by at least 50 
5 percent, more preferably by at least 90 percent, and most preferably prevent, a 
clinically significant deficit in the activity, function and response of the host. 
Alternatively, a therapeutically effective amount is sufficient to cause an 
^ improvement in a clinically significant condition in the host. 

The composition of the invention may be introduced parenterally or 
:ri 10 transmucosally, e.g., orally, nasally, vaginally, or rectally, or transdermally. 

;i| Preferably, administration is parenteral, e.g., via intravenous injection, and also 

E including, but is not limited to, intra-arteriole, intramuscular, intradermal, 

subcutaneous, intraperitoneal, intraventricular, and intracranial administration. The 
administration of the composition may be introduced by injection directly into the 
15 site to be treated, in particular, into the nervous system. 

The preferred route of administration to the nervous system is by direct 
injection. The nervous system can be imaged using any of the techniques available 
in the art, such as magnetic resonance imaging or computer-assisted tomography, 
and the therapeutic composition administered by stereotactic injection, for 
2 0 example. 

In yet another embodiment, a composition comprising a human ngn3 
polypeptide, or nucleic acid encoding the polypeptide, can be delivered in a 
controlled release system. For example, the nucleic acid or polypeptide may be 
administered using intravenous infusion, an implantable osmotic pump, a 

2 5 transdermal patch, liposomes, or other modes of administration. In one 

embodiment, a pump may be used [see Langer, supra; Sefton, CRC Crit. Ref. 
Biomed. Eng. 14:201 (1987); Buchwald et al.. Surgery 88:507 (1980); Saudek et 
al., N. Engl J. Med. 321:574 (1989)]. In another embodiment, polymeric materials 
can be used [see Medical Applications of Controlled Release, Langer and Wise 

3 0 (eds.), CRC Press: Boca Raton, Florida ( 1 974); Controlled Drug Bioavailability, 
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Drug Product Design and Performance^ Smolen and Ball (eds.), Wiley: New York 
(1984); Ranger and Peppas, J. Macromol Sci. Rev. Macromol. Chem. 23:61 
(1983); see also Levy et al.. Science 228:190 (1985); During et al., Ann. Neurol. 
25:351 (1989); Howard et al, J. Neurosurg, 71:105 (1989)]. In yet another 
5 embodiment, a controlled release system can be placed in proximity of the 

therapeutic target, i.e., the heart, thus requiring only a fraction of the systemic dose 
[see, e.g., Goodson, in Medical Applications of Controlled Release, supra, vol. 2, 
pp. 115-138 (1984)]. Other controlled release systems are discussed in the review 
by Langer [Science 249:1527-1533 (1990)]. 

1 0 Thus, the compositions of the invention can be delivered by intravenous, 

intraarterial, intraperitoneal, intramuscular, or subcutaneous routes of 
administration. Altematively, the compositions, properly formulated, can be 
administered by nasal or oral administration. A constant supply of the biological 
material can be ensured by providing a therapeutically effective dose (i.e., a dose 

15 effective to induce metabolic changes in a subject) at the necessary intervals, e.g., 
daily, every 12 hours, etc. These parameters will depend on the severity of the 
disease condition being treated, other actions, such as diet modification, that are 
implemented, the weight, age, and sex of the subject, and other criteria, which can 
be readily determined according to standard good medical practice by those of skill 

2 0 in the art. 

An organism in whom administration of a biological material within the 
scope of the invention is administered is preferably a human, but can be any 
animal. Thus, as can be readily appreciated by one of ordinary skill in the art, the 
methods and pharmaceutical compositions of the present invention are particularly 

2 5 suited to administration to any animal, particularly a mammal, and including, but 

by no means limited to, domestic animals, such as feline or canine subjects, farm 
animals, such as but not limited to bovine, equine, caprine, ovine, and porcine 
subjects, wild animals (whether in the wild or in a zoological garden), research 
animals, such as mice, rats, rabbits, goats, sheep, pigs, dogs, cats, etc., avian 

3 0 species, such as chickens, turkeys, songbirds, etc., i.e., for veterinary medical use. 
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The invention relates to the use of a polypeptide, nucleic acid, recombinant 
vector, recombinant virus, recombinant host cell, or implant according to the 
invention for the manufacture or preparation of a medicament or pharmaceutical 
composition intended for the prevention or treatment of a patient or subject 
5 suffering from a pathology, dysfunction, disorder, or condition affecting the 
nervous system. 

The invention relates to the use of a polypeptide according to the invention, 
preferably a polypeptide having an amino acid sequence of either one of SEQ ED 
NOs: 8 or 10 for the manufacture or preparation of a medicament or 
1 0 pharmaceutical composition intended for the prevention or treatment of a patient or 
subject suffering from a pathology, dysftmction, disorder, or condition affecting the 
nervous system. 

The invention also relates to the use of a nucleic acid according to the 
invention, preferably a nucleic acid encoding a polypeptide having an amino acid 

1 5 sequence of either one of SEQ ID NOs: 8 or 10 for the manufacture or preparation 
of a medicament or pharmaceutical composition intended for the prevention or 
treatment of a patient or subject suffering from a pathology, dysfimction, disorder, 
or condition affecting the nervous system. 

The invention also relates to the use of a recombinant vector according to 

2 0 the invention, preferably a recombinant vector comprising a nucleic acid encoding 
a polypeptide having an amino acid sequence of either one of SEQ ID NOs: 8 or 
10, for the manufacture or preparation of a medicament or pharmaceutical 
composition intended for the prevention or treatment of a patient or subject 
suffering from a pathology, dysfunction, disorder, or condition affecting the 

2 5 nervous system. 

The invention also relates to the use of a defective recombinant virus 
according to the invention, preferably a defective recombinant virus comprising a 
nucleic acid encoding a polypeptide having an amino acid sequence of either one of 
SEQ ID NOs: 8 or 10, for the manufacture or preparation of a medicament or 

3 0 pharmaceutical composition intended for the prevention or treatment of a patient or 
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subject suffering from a pathology, dysfunction, disorder, or condition affecting the 
nervous system. Thus, the present invention also relates to a pharmaceutical 
composition comprising one or more defective recombinant viruses according to 
the invention. 

5 The invention also relates to the use of a recombinant host cell according to 

the invention, preferably a recombinant host cell comprising a nucleic acid 
encoding a polypeptide having an amino acid sequence of either one of SEQ ED 
NOs: 8 or 10, for the manufacture or preparation of a medicament or 
pharmaceutical composition intended for the prevention or treatment of a patient or 
10 subject suffering from a pathology, dysfunction, disorder, or condition affecting the 
nervous system. 

The invention also relates to the use of an implant according to the 
invention, preferably an implant comprising a recombinant host cell comprising a 
nucleic acid encoding a polypeptide having an amino acid sequence of either one of 
1 5 SEQ ID NOs: 8 or 10, for the manufacture or preparation of a medicament or 

pharmaceutical composition intended for the prevention or treatment of a patient or 
subject suffering from a pathology, dysfunction, disorder, or condition affecting the 
nervous system. 

The present invention also relates to the use of cells genetically modified 
2 0 ex vivo with a virus according to the invention, or of producing cells such as 
viruses, implanted in the body, allowing a prolonged and effective expression 
in vivo of a biologically active polypeptide according to the invention, preferably a 
polypeptide having an amino acid sequence of either one of SEQ ID NOs: 8 or 10. 
The invention also relates to the use of a polypeptide according to the 

2 5 invention, preferably a polypeptide having an amino acid sequence of either one of 

SEQ ID NOs: 8 or 10, to control, influence, and/or participate in the expression of 
a gene. 

The invention also relates to the use of a nucleic acid according to the 
invention, preferably a nucleic acid comprising a polynucleotide sequence of either 

3 0 one of SEQ ID NOs: 1 or 9, for the manufacture or preparation of a medicament or 
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pharmaceutical composition intended for the prevention or treatment of a patient or 
subject suffering from a pathology, dysfunction, disorder, or condition affecting the 
nervous system. 

The invention also relates to the use of a recombinant vector according to 
5 the invention, preferably comprising a nucleic acid comprising a polynucleotide 
sequence of either one of SEQ ID NOs: 1 or 9, for the manufacture or preparation 
of a medicament or pharmaceutical composition intended for the prevention or 
treatment of a patient or subject suffering from a pathology, dysfiinction, disorder, 
or condition affecting the nervous system. 

10 

Nucleotide Probes and Primers of the Invention 
Nucleotide probes and primers hybridizing with a nucleic acid (genomic 
DNA, messenger RNA, cDNA) according to the invention also form part of the 
invention. 

15 The definition of a nucleotide probe or primer according to the invention 

therefore covers oligonucleotides that hybridize, under the high stringency 
hybridization conditions defined above, with a polynucleotide sequence of a 
nucleic acid according to the invention, or a complementary polynucleotide 
sequence. 

2 0 According to the invention, nucleic acid fragments derived from a 

polynucleotide according to the invention are useful for the detection of the 
presence of at least one copy of a bHLH nucleic acid according to the invention, or 
of a fragment or of a variant (containing a mutation or a polymorphism) thereof in 
a sample. 

2 5 According to the invention, nucleic acid fragments derived from a nucleic 

acid comprising a polynucleotide sequence of either one of SEQ ID NOs: 1 or 9, or 
of a complementary polynucleotide sequence, are useful for the detection of the 
presence of at least one copy of a nucleotide sequence of a bHLH nucleic acid 
according to the invention, or of a fragment or of a variant (containing a mutation 

3 0 or a polymorphism) thereof in a sample. Nucleotide probes and primers 
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hybridizing with a nucleic acid sequence of a nucleic acid (genomic DNA, 
messenger RNA, cDNA) that encodes a polypeptide comprising an amino acid 
sequence comprising SEQ ED NO: 8 or SEQ ID NO: 10, also form part of the 
invention. 

5 The nucleotide probes or primers according to the invention comprise at 

least 8 consecutive nucleotides of a nucleic acid comprising a polynucleotide 
sequence of a) SEQ ID NO: 1, or of a complementary polynucleotide sequence, 
or b) SEQ ID NO: 9, or of a complementary polynucleotide sequence. 

Preferably, nucleotide probes or primers according to the invention will 
10 have a length of at least 10, 12, 15, 18, 20 to 25, 35, 40, 50, 70, 80, 100, 200, 
500, 1000, or 1500 consecutive nucleotides of a nucleic acid according to the 
invention, in particular of a nucleic acid comprising a polynucleotide sequence 
of a) SEQ ID NO: 1, or of a complementary polynucleotide sequence, or b) SEQ 
ID NO: 9, or of a complementary polynucleotide sequence. 

15 Alternatively, a nucleotide probe or primer according to the invention will 

consist of and/or comprise a fragment having a length of 8, 10, 12, 15, 18, 20, 25, 
35, 40, 50, 100, 200, 500, 1000, or 1500 consecutive nucleotides of a nucleic acid 
according to the invention, more particularly of a nucleic acid comprising a 
polynucleotide sequence of a) SEQ ID NO: 1, or of a complementary 

2 0 polynucleotide sequence, or b) SEQ ED NO: 9, or of a complementary 
polynucleotide sequence. 

The preferred probes and primers according to the invention comprise all 
or part of a polynucleotide sequence comprising any one of SEQ ID NOs: 1, 2, 
3, 4, 5, 9, 11, 12, 14, 15, 16, 17, 18, 19, 21, 23, 24, or 25, or of a complementary 

2 5 polynucleotide sequence. 

The nucleotide primers according to the invention may be used to amplify 
any one of the nucleic acids according to the invention, and more particularly a 
nucleic acid comprising a polynucleotide sequence of either one of SEQ ED NOs: 1 
or 9, or of a complementary polynucleotide sequence. 
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Alternatively, the nucleotide primers according to the invention may be 
used to amplify a nucleic acid fragment or variant of a nucleic acid comprising a 
polynucleotide sequence of either one of SEQ ED NOs: 1 or 9, or of a 
complementary polynucleotide sequence. 
5 A nucleotide primer or probe according to the invention may be prepared 

by any suitable method well known to persons skilled in the art, including by 
cloning and action of restriction enzymes or by direct chemical synthesis 
according to techniques such as the phosphodiester method by Narang et al 
(Methods Enzymol, 68:90-98, 1979) or by Brown et aL (Methods Enzymol, 
10 68:109-151, 1979), the diethylphosphoramidite method by Beaucage et al. 
(Tetrahedron Lett, 22: 1859-1862, 1981) or the technique on a solid support 
described in EU patent No.EP0,707,592. 

Each of the nucleic acids according to the invention, including the 
oligonucleotide probes and primers described above, may be labeled, if desired, 
15 by incorporating a marker which can be detected by spectroscopic, 

photochemical, biochemical, immunochemical or chemical means. For 
example, such markers may consist of radioactive isotopes (^^P, ^^P, ^H, ^^S), 
fluorescent molecules (5-bromodeoxyuridine, fluorescein, acetylaminofluorene, 
digoxigenin) or ligands such as biotin. The labeling of the probes is preferably 

2 0 carried out by incorporating labeled molecules into the polynucleotides by 

primer extension, or altematively by addition to the 5' or 3'ends. Examples of 
nonradioactive labeling of nucleic acid fragments are described in particular in 
French patent No. 78 109 75 or in the articles by Urdea et al. (Nucleic Acids 
Research, 1 1:4937-4957, 1988) or Sanchez-Pescador et al. (J. Clin. Microbiol., 
25 26(10):1934-1938, 1988). 

Preferably, the nucleotide probes and primers according to the invention 
may have structural characteristics of the type to allow amplification of the signal, 
such as the probes described by Urdea et al. (Nucleic Acids Symp Ser., 24:197- 
200, 1991) or altematively in European patent No. EPO 225,807 (CHIRON). 

3 0 The oligonucleotide probes according to the invention may be used in 
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particular in Southern-type hybridizations with the genomic DNA or ahematively 
in hybridizations with the corresponding messenger RNA when the expression of 
the corresponding transcript is sought in a sample. 

The probes and primers according to the invention may also be used for the 
detection of products of PGR amplification or alternatively for the detection of 
mismatches. 

Nucleotide probes or primers according to the invention may be 
immobilized on a solid support. Such solid supports are well known to persons 
skilled in the art and comprise surfaces of wells of microti ter plates, polystyrene 
beds, magnetic beds, nitrocellulose bands or microparticles such as latex particles. 

Methods for Amplifying and Detecting Nucleic Acids Encoding Polypeptides of 

the hivention 

The invention also relates to means for the amplification and detection of 
nucleic acids according to the invention, preferably a nucleic acid encoding a 
polypeptide comprising an amino acid sequence of either one of SEQ ID NO: 8 
or SEQ ID NO: 10. 

A preferred embodiment of the present invention relates to a method of 
amplifying a nucleic acid according to the invention, and more particularly a 
nucleic acid comprising a polynucleotide sequence of SEQ ID NOs: 1 or 9, or of a 
complementary polynucleotide sequence; or a nucleic acid fi-agment or variant 
thereof contained in a sample, wherein said method comprises the steps of: 

a) bringing the sample in which the presence of the target nucleic acid is 
suspected into contact with a pair of nucleotide primers whose hybridization 
position is located respectively on the 5 'side and on the 3 'side of the region of the 
target nucleic acid whose amplification is sought, in the presence of the reagents 
necessary for the amplification reaction; and 

b) detecting the amplified nucleic acids. 

The present invention also relates to a method of detecting the presence of a 
nucleic acid in a sample, wherein the nucleic acid comprises a polynucleotide 
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sequence of SEQ ID NOs: 1 or 9, or of a complementary polynucleotide sequence; 
or a nucleic acid fragment or variant thereof, said method comprising the steps of: 

1 ) bringing one or more nucleotide probes according to the invention into 
contact with the sample to be tested; 

2) detecting the complex which may have formed between the probe(s) and 
the nucleic acid present in the sample. 

According to a specific embodiment of the method of detection according 
to the invention, the oligonucleotide probes and primers are immobilized on a 
support. 

According to another aspect, the oligonucleotide probes and primers 
comprise a detectable marker. 

The invention relates, in addition, to a box or kit for detecting the presence 
of a nucleic acid according to the invention in a sample, said box or kit comprising: 

a) one or more nucleotide probe(s) or primer(s) as described above; 

b) where appropriate, the reagents necessary for the hybridization reaction. 
According to a first aspect, the detection box or kit is characterized in that 

the probe(s) or primer(s) are immobilized on a support. 

According to a second aspect, the detection box or kit is characterized in 
that the oligonucleotide probes comprise a detectable marker. 

According to a specific embodiment of the detection kit described above, 
such a kit will comprise a plurality of oligonucleotide probes and/or primers in 
accordance with the invention which may be used to detect a target nucleic acid of 
interest or altematively to detect mutations in the coding regions or the non-coding 
regions of the nucleic acids according to the invention. 

Another subject of the invention is a box or kit for amplifying all or part of 
a nucleic acid comprising a polynucleotide sequence of SEQ ED NOs: 1 or 9, or of 
a complementary polynucleotide sequence, wherein said box or kit comprises: 

1 ) a pair of nucleotide primers in accordance with the invention, whose 
hybridization position is located respectively on the 5' side and 3' side of the 
target nucleic acid whose amplification is sought; and optionally, 
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2) reagents necessary for an amplification reaction. 

Such an amplification box or kit will preferably comprise at least one pair 
of nucleotide primers as described above. 

The invention also relates to a box or kit for detecting the presence of a 
nucleic acid according to the invention in a sample, said box or kit comprising: 

a) one or more nucleotide probes according to the invention; 

b) where appropriate, reagents necessary for a hybridization reaction. 
According to a first aspect, the detection box or kit is characterized in that 

the nucleotide probe(s) and primer(s)are immobilized on a support. 

According to a second aspect, the detection box or kit is characterized in 
that the nucleotide probe(s) and primer(s) comprise a detectable marker. 

According to a specific embodiment of the detection kit described above, 
such a kit will comprise a plurality of oligonucleotide probes and/or primers in 
accordance with the invention that may be used to detect target nucleic acids of 
interest. 

According to preferred embodiment of the invention, the target nucleic acid 
comprises a polynucleotide sequence of SEQ ED NOs: 1 or 9, or of a 
complementary polynucleotide sequence. 

Alternatively, the target nucleic acid is a nucleic acid fi-agment or variant of 
a nucleic acid comprising a polynucleotide sequence of SEQ ID NOs: 1 or 9, or of 
a complementary polynucleotide sequence. 

According to a preferred embodiment, two primers according to the 
invention comprise all or part of SEQ ID NOs: 4 and 13, making it possible to 
amplify the region of nucleotides 750-1460 of SEQ ED NO: 1, or a nucleic acid 
having a complementary polynucleotide sequence. 

According to a preferred embodiment, two primers according to the 
invention comprise all or part of SEQ ED NOs: 5 and 13, making it possible to 
amplify the region of nucleotides 777-1460 of SEQ ID NO: 1, or a nucleic acid 
having a complementary polynucleotide sequence. 

According to a preferred embodiment, two primers according to the 
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invention comprise all or part of SEQ ID NOs: 5 and 13, making it possible to 
amplify the region of nucleotides 640-1330 of SEQ DD NO: 9, or a nucleic acid 
having a complementary polynucleotide sequence. 

According to a preferred embodiment, two primers according to the 
5 invention comprise all or part of SEQ ID NOs: 5 and 25, making it possible to 
amplify the region of nucleotides 640-1330 of SEQ ID NO: 9, or a nucleic acid 
having a complementary polynucleotide sequence. 

According to a preferred embodiment, tw^o primers according to the 
invention comprise all or part of SEQ ED NOs: 24 and 25, making it possible to 
10 amplify the region of nucleotides 1-1330 of SEQ ID NO: 9, or a nucleic acid 
having a complementary polynucleotide sequence. 

According to another preferred embodiment, a primer according to the 
invention comprises, generally, all or part of any one of SEQ ID NOs: 1, 2, 3, 4, 5, 
9, 11, 12, 14, 15, 16, 17, 18, 19,21,23,24, or 25, or a complementary sequence. 
1 5 Thus, the probes according to the invention, immobilized on a support, may 

be ordered into matrices such as "DNA chips". Such ordered matrices have in 
particular been described in US patent No. 5,143,854, in published PCT 
apphcations WO 90/15070 and WO 92/10092. 

Support matrices on which oligonucleotide probes have been immobilized 
2 0 at a high density are for example described in US patent No. 5,412,087 and in 
pubhshed PCT application WO 95/1 1995. 

The nucleotide primers according to the invention may be used to amplify 
any one of the nucleic acids according to the invention, or a complementary 
polynucleotide sequence. Altematively, the nucleotide primers according to the 

2 5 invention may be used to amplify a nucleic acid fragment or variant of a nucleic 

acid according to the invention, or a complementary polynucleotide sequence. 

Another subject of the invention relates to a method of amplifying a nucleic 
acid according to the invention, or a complementary polynucleotide sequence, 
contained in a sample, said method comprising the steps of: 

3 0 a) bringing the sample in which the presence of the target nucleic acid is 
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suspected into contact with a pair of nucleotide primers whose hybridization 
position is located respectively on the 5' side and on the 3' side of the region of the 
target nucleic acid whose amplification is sought, in the presence of the reagents 
necessary for the amplification reaction; and 
5 b) detecting the amplified nucleic acids. 

To carry out the amplification method as defined above, use will be 
preferably made of any of the nucleotide primers described above. 

The subject of the invention is, in addition, a box or kit for amplifying all or 
part of a nucleic acid according to the invention, or a complementary 
1 0 polynucleotide sequence, said box or kit comprising: 

a) a pair of nucleotide primers in accordance with the invention, whose 
hybridization position is located respectively on the 5' side and 3' side of the 
target nucleic acid whose amplification is sought; and optionally, 

b) reagents necessary for the amplification reaction. 

15 Such an amplification box or kit will preferably comprise at least one pair 

of nucleotide primers as described above. 

The invention also relates to a box or kit for detecting the presence of a 
nucleic acid according to the invention in a sample, said box or kit comprising: 

a) one or more nucleotide probes according to the invention; 
2 0 b) where appropriate, reagents necessary for a hybridization reaction. 

According to a first aspect, the detection box or kit is characterized in that 
the nucleotide probe(s) and primer(s)are immobilized on a support. 

According to a second aspect, the detection box or kit is characterized in 
that the nucleotide probe(s) and primer(s) comprise a detectable marker. 

2 5 According to a specific embodiment of the detection kit described above, 

such a kit will comprise a plurality of oligonucleotide probes and/or primers in 
accordance with the invention which may be used to detect target nucleic acids of 
interest or alternatively to detect mutations in the coding regions or the non-coding 
regions of the nucleic acids according to the invention. 

3 0 According to preferred embodiment of the invention, the target nucleic acid 
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comprises a polynucleotide sequence of SEQ ID NOs: 1 or 9, or of a 
complementary polynucleotide sequence. 

Alternatively, the target nucleic acid is a nucleic acid fragment or variant of 
a nucleic acid comprising a polynucleotide sequence of SEQ ID NOs: 1 or 9, or of 
5 a complementary polynucleotide sequence. 

According to a preferred embodiment, a primer according to the invention 
comprises, generally, all or part of any one of SEQ ED NOs: 1, 2, 3, 4, 5, 9, 1 1, 12, 
14, 15, 16, 17, 18, 19, 21, 23, 24, or 25, or a complementary sequence. 

10 Screening Assays 

Identification and isolation of a gene encoding a polypeptide of the 
invention provides for expression of the polypeptide in quantities greater than can 
be isolated from natural sources, or in indicator cells that are specially engineered 
to indicate the activity of a polypeptide of the invention expressed after transfection 

15 or transformation of the cells. Accordingly, in addition to rational design of 

agonists and antagonists based on the structure of a polypeptide of the invention, 
the present invention contemplates an alternative method for identifying specific 
ligands of the polypeptide using various screening assays known in the art. 

Basic helix-loop-helix proteins play a role in cell differentiation and/or cell 

2 0 fate determination by regulating or inducing transcription of key genes. Therefore, 
agonists and antagonists of a polypeptide of the present invention will be expected 
to improve or decrease its transcription factor activity. 

The invention provides methods for screening small molecules and 
compounds that act on a basic helix-loop-helix protein to identify agonists and 

2 5 antagonists of Relax or hngn3 polypeptides that can improve, reduce, or inhibit 

neurogenesis, or a function of the nervous system, or a function of the endocrine 
system, in particular, the pancreas from a therapeutic point of view. For example, 
hngn3 may be implicated in the transforming process that leads to some 
neuroectodermal and/or ptincreatic tumors. In these cases, inhibiton of bHLH 

3 0 activity could be of great therapeutic value. Furthermore, activation of hngn3 may 
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be useful to generate in vitro or in vivo specific neuronal cells and endocrine 
pancreatic cells. It has recently been reported that the overexpression of ngn3 in 

32 

the developing pancreas yields an increase number of insulin producing cells. 
Thus, the activation of hngn3 polypeptides may be useful in the treatment of type 1 
5 diabetes mellitus. hi general, these methods are useful to identify small molecules 
and compounds for therapeutic use in the treatment of a disease, condition, 
disorder, pathology, or dysfunction of the nervous system, or the endocrine system, 
specifically, the pancreatic organ. 

Thus, the present invention relates to a method of screening for antagonists 
10 or agonists of the polypeptides according to the invention. Preferably, the method 
of screening for antagonists or agonists of the polypeptides according to the 
invention is directed toward identifying an antagonist or agonist of a polypeptide 
comprising an amino acid sequence of either one of SEQ ID NO: 8 or SEQ ED NO: 
10. 

15 Therefore, the invention also relates to the use of a polypeptide according 

to the invention or a cell expressing a polypeptide according to the invention, for 
screening active ingredients for the prevention or treatment of a disease, 
condition, disorder, pathology, or dysfunction of the nervous system. An active 
site and oligopeptide or immunogenic fragments of a polypeptide according to 

2 0 the invention can serve for screening product libraries by a whole range of 

existing techniques. The polypeptide firagment used in this type of screening may 
be free in solution, bound to a solid support, at the cell surface or in the cell. The 
formation of the binding complexes between the polypeptide fragments and the 
tested agent can then be measured. 

2 5 Another product screening technique that may be used in high- flux 

screenings giving access to products having affinity for the protein of interest is 
described in application WO84/03564. In this method, applied to a polypeptide 
according to the invention, various products are synthesized on a solid surface. 
These products react with the polypeptide or fragment thereof and the complex is 

3 0 washed. The products binding the polypeptide are then detected by methods known 
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to persons skilled in the art. Non-neutralizing antibodies can also be used to 
capture a peptide and immobilize it on a support. 

Another possibility is to perform a product screening method using a 
neutralizing antibody competition, a polypeptide according to the invention, and a 
5 product potentially binding the polypeptide. Li this manner, the antibodies may be 
used to detect the presence of a peptide having a common antigenic unit with the 
polypeptide or protein according to the invention. 

Accordingly, this invention relates to the use of any method of screening 
products, z.e., compounds, small molecules, and the like, this being in all synthetic 
10 or cellular types, that is to say of mammals, insects, bacteria, or yeasts expressing 
constitutively or having incorporated a rat Relax or human ngn3 encoding nucleic 
acid. 

The present invention also relates to the use of such a system for screening 
molecules that modulate the activity of a polypeptide according to the invention. 

15 Thus, the invention relates to methods of screening and identifying a modulator, 
agonist, or antagonist of a polypeptide according to the invention in a sample. 

Any screening technique known in the art can be used to screen for agonists 
or antagonists of the polypeptides of the invention. For example, a suitable isolated 
cell or cell line that expresses a polypeptide according to the invention, preferably 

2 0 human ngn3, can be transfected with a nucleic acid encoding a marker gene, such 
as p-galactosidase (|3-gal). The isolated cell or cell line may express the 
polypeptide of the invention either naturally or after transfection of a nucleic acid 
encoding the polypeptide. The cell or cell line is then exposed to a test solution or 
sample comprising a potential agonist or antagonist, and then stained for (3- 

2 5 galactosidase activity. The presence of more p-gal positive cells relative to control 

cells not exposed to the test solution or sample is an indication of the presence of 
an agonist in the test solution or sample. Conversely, the presence of less (J-gal 
positive cells relative to control cells not exposed to the test solution or sample is 
an indication of the presence of an antagonist in the test solution or sample. 

3 0 The present invention relates to methods of identifying a modulator, 
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agonist, or antagonist of a polypeptide according to the invention in a sample 
comprising 

a) obtaining a cell, for example a cell line, that, either naturally or after 
transfecting the cell with a nucleic acid encoding a polypeptide according to the 

5 invention, expresses a polypeptide according to the invention, 

b) transfecting the cell with a nucleic acid encoding a marker gene, such as 
p-galactosidase (p-gal), 

c) incubating the cell of step b) with a test solution or sample comprising a 
potential modulator, agonist or antagonist, 

10 d) measuring the amount of p-galactosidase activity, and 

e) comparing the amount of p-galactosidase activity measured in step d) 
with an amount of p-galactosidase activity measured with a cell that has not been 
previously incubated in the presence of the candidate modulator, agonist, or 
antagonist compound for the polypeptide according to the invention. 

15 Another method of the present invention for identifying an agonist or 

antagonist of the ngn3 polypeptide is based on Applicant's discovery that ngn3 
polypeptide specifically activates its own promoter as shown in the spinal cord and 
pancreas. Applicants have recognized that the expression of ngn3 is strictly 
dependent on the ngn3 protein. To date, no other members of the bHLH family 

2 0 have been found to activate or modulate the endogenous ngn3 promoter. 

According to the novel method in the present invention, the ngn3 responsive 
element located within the ngn3 promoter may be placed upstream of a reporter 
gene, for example, p-galactosidase, firefly luciferase, or green fluorescent protein, 
and be used as a specific marker to assay the level of activity of the ngn3 

2 5 polypeptide in the presence of an agonist or antagonist in a cell that expresses a 

known amount of ngn3 protein. The compound that modulates the endogenous 
activity of the ngn3 polypeptides will modify the expression of the reporter gene. 
The cell may be a primary culture cell or a cell line that either naturally or after 
transfecting with a nucleic acid encoding a ngn3 polypeptide according to the 

3 0 invention, expresses the polypeptide according to the invention. 
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In a specific embodiment, the polypeptide comprises an amino acid 
sequence of SEQ ID NO: 8 or SEQ ID NO: 10. 

According to a first aspect of the above screening methods, the cells used 
5 are cells naturally expressing a polypeptide according to the invention. Preferably, 
the cell expresses a polypeptide comprising an amino acid sequence of 
SEQ ID NO: 8 or SEQ ID NO: 10. The cells may be primary culture cells obtained 
fi-om a population of embryonic pancreatic epithelium and/or ventral spinal cord, 
for example. 

1 0 According to a second aspect, the cells used in the screening methods 

described above may be cells not naturally expressing, or alternatively expressing 
at a low level, a polypeptide according to the invention, said cells being transfected 
with a recombinant vector according to the invention capable of directing the 
expression of a nucleic acid encoding the polypeptide. Preferably, the cell 

15 expresses a polypeptide comprising an amino acid sequence of SEQ ID NO: 8 or 
SEQ ID NO: 10. 

According to a third aspect of the above screening methods, the polypeptide 
according to the invention may be a recombinantly produced polypeptide. 
Preferably, the recombinantly produced polypeptide comprises an amino acid 
2 0 sequence of SEQ ID NO: 8 or SEQ ID NO: 1 0. 

The present invention contemplates screens for small molecule ligands or 
ligand analogs and mimics, as well as screens for natural ligands that bind to and 
agonize or antagonize a polypeptide of the invention in vivo. For example, natural 
products libraries can be screened using assays of the invention for molecules that 

2 5 agonize or antagonize the polypeptide's activity. 

Knowledge of the primary sequences of Relax and hngnS, and the similarity 
of these sequences with proteins of known function, can provide an initial clue as 
to the inhibitors or antagonists of the polypeptides of the invention. Identification 
and screening of antagonists is further facilitated by determining structural features 

3 0 of the protein, e,g., using X-ray crystallography, neutron diffi-action, nuclear 
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magnetic resonance spectrometry, and other techniques for structure determination. 
These techniques provide for the rational design or identification of agonists and 
antagonists. 

Another approach uses recombinant bacteriophage to produce large 
5 hbraries. Using the "phage method" [Scott and Smith, 1990, Science 249:386-390 
(1990); Cwirla, et aL, Proc. Natl Acad, Sci., 87:6378-6382 (1990); DevUn et al.. 
Science, 249:404-406 (1990)], very large hbraries can be constructed (10^-10^ 
chemical entities). A second approach uses primarily chemical methods, of which 
the Geysen method [Geysen et al.. Molecular Immunology 23:709-715 (1986); 

1 0 Geysen et al. J. Immunologic Method 1 02:259-274 (1987)] and the method of 
Fodor et al. [Science 251:767-773 (1991)] are examples. Furka et al. [J 4th 
International Congress of Biochemistry , Volume 5, Abstract FR:013 (1988); Furka, 
Int. J. Peptide Protein Res. 37:487-493 (1991)], Houghton [US. Patent No. 
4,631,21 1, issued December 1986] and Rutter et al. [U.S. Patent No. 5,010,175, 

15 issued April 23, 1991] describe methods to produce a mixture of peptides that can 
be tested as agonists or antagonists. 

In another aspect, synthetic libraries [Needels et al., Proc. Natl Acad. Sci. 
USA 90:10700-4 (1993); Ohlmeyer et al., Proc. Natl. Acad. Sci. USA 90:10922- 
10926 (1993); Lam et al., Litemational Patent Pubhcation No. WO 92/00252; 

2 0 Kocis et al.. International Patent Publication No. WO 9428028, each of which is 
incorporated herein by reference in its entirety], and the like can be used to screen 
for ligands of the polypeptides according to the present invention. 

The screening can be performed with recombinant cells that express a 
polypeptide of the invention, or altematively, using purified protein, e.g., produced 

2 5 recombinantly, as described above. For example, labeled, soluble Relax or hngn3 

can be used to screen libraries, as described in the foregoing references. 

In one embodiment, a polypeptide of the invention may be directly labeled. 
In another embodiment, a labeled secondary reagent may be used to detect binding 
of the polypeptide of the invention to a molecule of interest, e.g., a molecule 

3 0 attached to a solid phase support. Binding may be detected by in situ formation of 
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a chromophore by an enzyme label. Suitable enzymes include, but are not limited 
to, alkaline phosphatase and horseradish peroxidase. In a further embodiment, a 
two-color assay, using two chromogenic substrates with two enzyme labels on 
different acceptor molecules of interest, may be used. Cross-reactive and singly- 
5 reactive ligands may be identified with a two-color assay. 

Other labels for use in the invention include colored latex beads, magnetic 
beads, fluorescent labels (e.g., fluorescene isothiocyanate (FITC), phycoerythrin 
(PE), Texas red (TR), rhodamine, fi-ee or chelated lanthanide series salts, especially 
Eu^^, to name a few fluorophores), chemiluminescent molecules, radio-isotopes, or 

1 0 magnetic resonance imaging labels. Two color assays may be performed with two 
or more colored latex beads, or fluorophores that emit at different wavelengths. 
Labeled may be detected visually or by mechanical/optical means. 
Mechanical/optical means include fluorescence activated sorting, i.e,, analogous to 
FACS, and micromanipulator removal means. 

15 As exemplified herein, the level of a polypeptide of the invention can be 

evaluated by metabolic labeling of the proteins. As the metabolic labeling occurs 
during in vitro incubation of the tissue biopsy in the presence of culture medium 

35 

supplemented with [ S] -methionine, the level of each of the markers detected may 
be affected by the in vitro conditions. In addition to metabolic (or biosynthetic) 

2 0 labeling with [^^S]-methionine, the invention further contemplates labeling with 
[^"^CJ-amino acids and [^H]-amino acids (with the tritium substituted at non-labile 
positions). Thus, a sample or library of compounds can be directly analyzed after 
labeling of the proteins therein, e.g., by colorimetric staining using silver, gold, 
coomassie blue, or amido-schwartz, to mention a few techniques; isotopic labeling, 

2 5 e.g., with [^^P]-orthophosphate, [^^^I], [*^^I]; fluorescent or chemiluminescent tags; 
and immunological detection with labeled antibody or specific binding partner of a 
marker. 



The present invention may be better understood by reference to the 
3 0 following non-limiting Examples, which are provided as exemplary of the 





ST96042A US 



80 



invention. The examples and figures presented below by way of illustration and 
with no limitation being implied demonstrate other advantages and characteristics 
of the present invention. 



General molecular biology techniques 

The methods traditionally used in molecular biology, such as preparative 
extractions of plasmid DNA, centrifugation of plasmid DNA in a caesium chloride 

10 gradient, agarose or acrylamide gel electrophoresis, purification of DNA fi-agments 
by electroelution, protein extraction with phenol or phenol/chloroform, ethanol or 
isopropanol precipitation of DNA in a saline medium, transformation in 
Escherichia coli, and the like, are well known to a person skilled in the art and are 
amply described in the literature [Maniatis T. et al., "Molecular Cloning, a 

15 Laboratory Manual", Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., 
1982; (2""* Ed. 1989); Ausubel F.M. et al. (eds), "Current Protocols in Molecular 
Biology", John Wiley & Sons, New York, 1987]. 

Conventional cloning vehicles include pBR322 and pUC type plasmids and 
phages of the Ml 3 series. These may be obtained commercially (Bethesda 

2 0 Research Laboratories). 

For ligation, DNA fi-agments may be separated according to their size by 
agarose or acrylamide gel electrophoresis, extracted with phenol or with a 
phenol/chloroform mixture, precipitated with ethanol and then incubated in the 
presence of phage T4 DNA ligase (Biolabs) according to the supplier's 

2 5 recommendations. 

The filling in of 5* protruding ends may be performed with the Klenow 
fragment of E. coli DNA polymerase I (Biolabs) according to the supplier's 
specifications. The destruction of 3' protruding ends is performed in the presence of 
phage T4 DNA polymerase (Biolabs) used according to the manufacturer's 

3 0 recommendations. The destruction of 5' protruding ends is performed by a 
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controlled treatment with SI nuclease. 

Mutagenesis directed in vitro by synthetic oligodeoxynucleotides may be 
performed according to the method developed by Taylor et al. [Nucleic Acids Res. 
13 (1985) 8749-8764] using the kit distributed by Amersham. 

The enzymatic amplification of DNA fragments by PGR [Polymerase- 
catalyzed Chain Reaction, Saiki R.K. et al.. Science 230 (1985) 1350-1354; Mullis 
K.B. and Faloona F.A., Meth. Enzym. 155 (1987) 335-350] technique may be 
performed using a "DNA thermal cycler" (Perkin Elmer Cetus) according to the 
manufacturer's specifications. 

Verification of nucleotide sequences may be performed by the method 
developed by Sanger et al. [Proc. Natl. Acad. Sci. USA, 74 (1977) 5463-5467] 
using the kit distributed by Amersham. 

Plasmid DNAs may be purified by the Qiagen Plasmid Purification System 
according to the manufacture's instruction. 

EXAMPLE 1 : Direct amplification by PCR of the sequences between the basic 
domains and Helix II with the aid of degenerate oligonucleotides. 

The isolation of the cDNA encoding new transcription factors of the bHLH 
family is carried out using rat SCG (Rat Superior Cervical Ganglia). Given that the 
proteins of this family have highly conserved amino acid units, in particular a basic 
domain followed by Helix I and Helix 2, degenerate oligonucleotides encoding 
these various conserved regions are chosen. They were therefore selected so as to 
conserve the basic regions and the Helix II domain of the complex Drosophilia 
achaete-scute and the bHLH mammalian genes such as Mash-1, Mash-2 and M- 
Twist. Such oligonucleotides have been used respectively as sense and anti-sense 
primers on the one hand, to carry out direct screening of an SCG library 
constructed by PCR, and on the other hand in order to carry out PCRs on SCG ss- 
cDNAs. 

The sense oligonucleotide was chosen from the basic domain of these 
bHLH proteins; it is more precisely the oligonucleotide corresponding to the 
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following polypeptide fragment: 5'-AATKHGMGIGAGCGCIDKCGCRYG-3' 
(SEQ ID NO: 2). 

The anti-sense oligonucleotide, for its part, was determined from the Helix 
II domain of the same bHLH proteins. It is the oligonucleotide corresponding to the 
5 following polypeptide fragment: 5'-GGCSRDTYTCAGGGTSYBGAYCTT-3' 
(SEQ ID NO: 3). 

The single-stranded cDNA (ss-cDNA) is prepared from 500 ng of 
denatured polyA^ RNA derived from the SCG of two-day-old rats. 

The PGR amplification is carried out on a mixture comprising, in addition 

10 to the PGR buffer (10 mM Tris, pH 8.3, 50 mM Kcl, 2.5 mM MgC12), 200 |JV1 
dNTP, 12.5 \xM of each oligonucleotide and about 20 ng of single-stranded (ss)- 
cDNA. The PGR is carried out by performing successively 3 minutes of 
denaturation at 94°C; 2 cycles [94°C for 30 seconds(sec); 50°C for 45 sec; 72^c for 
1.5 minutes(min)] followed by 38 cycles (94'^C for 30 sec; 62^C for 45 sec; 72°C 

15 for 1.5 min). The PGR products are cloned at the Smal site of pUC19 and the 1500 
clones thus obtained analyzed by sequencing. Applicants searched for the presence 
of a Helix I unit in the amino acid sequence deduced from the nucleotide sequence 
of these various cDNAs. A large proportion of these cDNAs (670) contain such a 
unit, and among them, only one corresponds to a new possible member of the 

2 0 bHLH family. The other cDNAs isolated correspond to known bHLH proteins. 

The new isolated cDNA, called CIG235 and represented in Figure 1, 
corresponds to a fragment of 76 nucleotides (excluding the sequences 
corresponding to the oligonucleotides introduced by PGR). The alignments of 
amino acid sequences were performed by authorizing neither insertions nor 

2 5 deletions. These alignments show that this fragment encodes a unit homologous to 
the Helix I unit of the bHLH proteins. The isolation of the complete cDNA was 
then undertaken. 



30 



EXAMPLE 2: Isolation of the complete cDNA corresponding to GIG235. 



The isolation of the complete cDNA corresponding to GIG235 was 



o 



o 



ST96042A US 

83 

extremely difficult because the mRNA corresponding to CIG235 is very little 
expressed in the SCG. It even appears that its expression level is close to that of the 
illegitimate transcription. The technique selected consists, on the one hand, in 
isolating the 3* end of the cDNA corresponding to CIG235 by anchored PGR and, 
5 on the other hand, in isolating the 5* end of this cDNA from the spinal cord of rat 
embryos at the E12.5 stage. 

1. Extension of the cDNA CIG235 in 3^ by anchored PGR.. 

For that, SCG ss-cDNA were prepared with the RA3* NV primer (random 
1 0 reverse transcription primer carrying the 3' NV anchor) according to the Dumas 
Milne Edwards et al. method (1995, A Practical Approach Ed McPherson et al., 
pp89-l 18, IRL Press Oxford U.K.). Two specific nested primers are chosen in the 
bHLH positive clone of 76 nucleotides: CIG5*-1 : 5- 

AACCTTAACTCCGCGCTGGATGCGC-3* (SEQ ID NO: 4) and CIG5'-2: 5'- 

1 5 CGCGGTGTCCTGCCCACC-3' (SEQ ED NO: 5). 

Applicants then amplified the 3' end of CIG235 by two nested PCRs with 
the pairs of oligonucleotides CIGS'l and A3'-l and then CIG57 and A3*-2. The 
PGR mixture identified in Example 1 is used with only 0.8 |xM of each of the 
oligonucleotides and a denaturation step at 94*^C for 30 sec followed by an 

2 0 extension step at 72°C for 1 .5 min. The "annealing" is carried out for 45 sec with 
variations in temperature. During the first ten cycles, this temperature is reduced by 
1°C every two cycles from 70°C to 65°C and then maintained at during 30 
cycles. The resulting products are cloned into pUC19 and sequenced. Applicants 
thus obtained a cDNA of 717 nucleotides, called CIG3*, whose 5* part corresponds 

2 5 to the 3' end of the clone GIG235. Analysis of the open reading frames shows that 
the entire sequence encoding the carboxy- terminal end of the protein was isolated 
with 321 nucleotides of noncoding 3* region. 



2. Isolation of the 5' end corresponding to CIG3' from the spinal cord of rat 
3 0 embryos at the El 2. 5 stage. 
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Applicants sought out by in situ hybridization the pattern of expression of 
the mRNA corresponding to CIG3'. Its expression is estimated at various stages of 
the embryonic development. The method chosen is that of in situ hybridization on 
thick rat embryo sections w^ith a digoxygenin-labelled probe. This method is in fact 
5 an adaptation of that described for in toto hybridization (1994, A Practical 

Approach, Ed Wilkinson et al., pp75-83, IRL Press Oxford U.K.), and makes it 
possible to rapidly screen whole embryos at numerous different stages. 

A high expression of this mRNA was detected in the spinal cord, at 
embryonic day 12.5 (E12.5). Applicants therefore prepared mRNAs from spinal 
1 0 cord at this stage of development, and carried out the isolation of the 5' end by the 
SLIC method. A PGR product of 893 nucleotides is isolated and called CIG5'. 

The C1G3' and CIG5' cDNAs overlap over a region of 120 nucleotides and 
form a total cDNA whose size is 1491 nucleotides. 

However, given that CIG3' and CIG5' were not isolated from the same 
15 tissue, it was necessary to show that the contig generated by the alignment of these 
two clones corresponded to an existing mRNA and not to a chimera generated by 
PGR. Applicants therefore verified by PGR, after reverse transcription, that the 
total mRNA is present in the spinal cord at E12.5. In this tissue, a single PGR 
amplification makes it possible to visualize this mRNA in its entirety. Furthermore, 
2 0 analysis of the mRNAs by Northern blotting shows that all three of the probes 

CIG3', CIG5' and total cDNA make it possible to detect a single mRNA of 3.1 kb. 
Finally, in situ hybridization, the spatiotemporal expression pattern obtained with 
GIG3' or with the total cDNA is absolutely identical. These three results allow us to 
assert that GIG3* and GIGS' indeed correspond to two fragments of a same mRNA. 

2 5 In order to dispense with the point mutations introduced by Taq 

polymerase, Applicants carried out, on ss-cDNAs of the spinal cord at El 2.5, four 
independent amplifications of the entire cDNA. Applicants were thus able to 
accurately determine the sequence of this cDNA that is represented in SEQ ID NO: 
1. 

3 0 Analysis of the cDNA sequence shows that Applicants isolated the entire 
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coding region, the noncoding 5' region as well as part of the untranslated 3' region. 
Furthermore, the expression of the corresponding mRNA is detected in the embryo 
in the form of longitudinal bands. This characteristic led to this cDNA being 
designated Relax, for Rat embryonic longitudinal axis. 

EXAMPLE 3: Comparison of amino acid sequences of the bHLH domain of Relax 
with other proteins of the BHLH family. 

Analysis of the amino acid sequence deduced from the Relax cDNA clearly 
indicates that this protein is part of the bHLH family. The bHLH domain of Relax 
shares 68% identity with the bHLH unit of the protein NeuroD (Lee et al., 1995). 
Furthermore, it appears that Relax shares more sequence identities with proteins 
homologous to (Math-1, Akazawa et al., 1995) than with those homologous to AS- 
C (Mash-1, Johnson et al., 1990) (Figure 2). However, the total absence of 
sequence identity, outside the bHLH domain, with the other members of the family 
reveals that Relax is not the paralogue of an already identified protein. For this 
reason, Relax probably corresponds to the first member of a new subfamily of 
bHLH proteins. 

EXAMPLE 4: Characterization of the transcriptional activity of the Relax protein. 

The amino acid sequence analyses indicate that Relax shares a common 
structure with proteins of the bHLH family. These proteins are transcription 
factors. It is therefore important to demonstrate that the Relax cDNA encodes a 
protein whose characteristics are those of a transcription factor, that is to say that 
Relax is capable of binding to DNA on the target sites of bHLH factors, and that it 
can regulate transcription. 

1 . Proteins of the bHLH family form homo- or heterodimers and bind 
specifically to DNA (Murre et al, 1989, Cell, 558, 537-44), 

The in vitro characterization of the DNA binding sites has allowed the 
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identification of a consensus hexanucleotide sequence, CANNTG, called E box 
(Murre et al., 1989). Applicants have shown, by gel migration retardation 
experiments, that the purified recombinant Relax protein is capable of binding 
specifically to a sequence containing an E box. To do this, the DNA encoding the 
Relax protein is inserted between the Ndel and Kpnl sites into a vector pFLAG- 
CTC. After expression of the protein in Escherichia coli^ the fusion protein Relax- 
Flag is purified on an affinity column and then used in a binding reaction in the 
presence of an oligonucleotide containing an E box. 

2. The transcriptional activity of the Relax protein, for its part, was tested by 
cotransfection experiments in the cell line PCI 2, derived from rat 
pheochromocytomas. 

For that, Applicants used a luciferase reporter gene placed under the 
control, on the one hand, of a promoter containing an E box (5 kb of the TH 
promoter) and, on the other hand, of the same promoter containing a mutated E 
box. 

In parallel. Applicants prepared a vector for expressing the Relax protein as 
follows: 

(a) The cDNA encoding the Relax protein (fi-om position 389 to 1235) is 
inserted between the EcoRI and Xhol sites into the expression vector pcDNA3 
downstream of the CMV promoter. The rat TH promoter carrying an E box 
CAGGTG (SEQ ID NO: 6) at position 197 on a 5 kbp firagment is inserted into the 
blunt ended Hindlll site in the plasmid pKSLuc which becomes 5kbTH-Luc. The 
site is mutated into TCCGTG (SEQ ID NO: 7) and the resulting plasmid 
designated 5kbTH(Delta E)-Luc. 

(b) The plasmid pcDN A3 -Relax is cotransfected in various quantities 
ranging fi-om 0 to 5 pmol, and brought to 5 pmol with empty vector pcDNA3, in 
PC 12 cells (10^ cells/plate) mixed with 1 pmol of plasmid 5kbTH-Luc, 5kb- 
TH(deltaE)-Luc or pKSLuc. To evaluate the transfection efficiency, 0.2 pmol of a 
vector for expressing the gene encoding the CAT (chloramphenicol 
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acetyltransferase) protence under the control of the RSV promoter are also 
cotransfected. 

Measurement of the luciferase activity, after cotransfection of these vectors 
with increasing quantities of a vector for expressing the Relax protein, 
5 demonstrates that Relax activates the transcription of the reporter gene depending 
on the quantity of expression vector used and that this activation strictly requires 
the presence of an intact E box. 

EXAMPLE 5 : Characterization of the sites for expression of the Relax mRNA 

10 during embryonic development. 

Applicants systematically analysed, by in situ hybridization, the distribution 
of the Relax mRNA in rats during embryonic development. To do this, in situ 
hybridizations in the rat embryo are carried out according to the Wilkinson method 
with anti-sense riboprobes either in toto or on sections of 250 )am. The riboprobes 

15 are prepared in the presence of Ig of linearized plasmid and in the presence of 3.5 
nmol of digoxygenin-1 1-UTP using the Promega riboprobe kit. This detailed 
analysis shows that the expression of the Relax mRNA is transitory and spatially 
restricted. 

This mRNA is only present in the embryo, between days 11.5 and 1 8.5 of 
2 0 development. Its expression is restricted to the central nervous system. Indeed, no 
expression was detected, either in the peripheral nervous system, or outside of the 
nervous system. Inside the CNS, the Relax mRNA is expressed only in the spinal 
cord, the posterior brain and the anterior brain. 

More precisely, the Relax mRNA is expressed in longitudinal 

2 5 compartments in the spinal cord and the posterior brain. 

Its distribution in the spinal cord is restricted therein to two continuous 
bands of bilateral symmetry, longitudinally organized and restricted, on the DA^ 
axis, to the basal lamina, one at its ventral end, the other at its dorsal end. 

In the anterior brain, the Relax mRNAs are located in two groups of cells 

3 0 situated on either side of the optic recess, in the ventral region of the hypothalamus 
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and in the preoptic area. The predominant group of cells expressing Relax is 
situated in the basal part. Only the few cells of the preoptic area are situated in the 
wing region. The situation in the anterior brain is similar to that in the spinal cord 
and the posterior brain, hideed, the cells expressing Relax are, in all cases, strictly 
5 located in the ventricular zone. 




EXAMPLE 6: Characterization of the moments of expression of the Relax mRNA 
] during embryonic development. 

d All the sites of expression of Relax simultaneously belong to El 1 .5. They 

111 1 0 disappear progressively from the spinal cord between E16.5 and El 8.5, according 

to a ventrodorsal gradient that follows the progressive reduction of the ventricular 
i'; zone, hi the posterior brain, the expression of Relax switches off, at the same time 

as that of the median band in the spinal cord. Finally, in the anterior brain, the 
rfj expression of Relax disappears after El 8.5. It is important to underline that the 

15 expression of Relax is confined to the ventricular zone and that the temporal 
3 window for expression coincides with that for the production of the neurons in 

these different structures. 



EXAMPLE 7: Isolation of a new cDNA encoding the human neurogenin 3 
2 0 (hnj^nS) homolog. 

Applicants have shown that the overexpression of Relax^ the rat homolog 
of ngn3, in Xenopus embryo, induces the ectopic production of neurons 
throughout the ectoderm, thereby demonstrating that ngnS has a neuronal 
determination function (Examples 1-6 supra and ^^). Here, Applicants report the 
2 5 isolation of the human neurogenin 3 homolog (hngn3). 

PGR primers: All PGR primers were chosen with the Oligo 4.0 software 
program. The nucleotide position of all the primers listed below is given relative 
to the genomic hngn3 sequence (SEQ ID NO: 9). 

ss-cDNA synthesis: Randomly primed ss-cDNAs were synthesized using total 
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RNA extracted from the spinal cord of a 7-week-old human embryo. Total RNA 
was prepared with RNAble (Eurobio) based on a guanidine/phenol extraction 
procedure described by Chomczynski and Saachi^^. The ss-cDNAs were 
synthesized from 2|ig of denatured total RNA in the following reaction mix: 0.4 
5 i^M random hexamer, 2.5 mM dNTP, 4mM MgCl2, 10 mM Tris, pH 9, 50 mM 
KCU 0.1% Triton X-100, 0.01% gelatin, 10 mM DTT, 8 U RNasin and 2.4 U 
AMV RTase. The mixture was incubated at 42°C for 1 hour. 

Amplification of the hngn3 bHLH domain: PGR was performed using two 
specific primers deduced from the nucleotide sequences of Relax and ngn3 

10 corresponding to the basic domain: hngnSB (nucleotides 585-602) 5'- 

CAACGACCGCGAGCGCAA-3' (SEQ ID NO: 1 1) and to the helix II domain: 
hngnSHII (nucleotides 722-699) 5'-GCCCAGATGTAGTTGTGGGCGAAG-3' 
(SEQ ID NO: 12). The PGR mix consisted of 10 mM Tris pH 9, 50 mM KCl, 
0.1% Triton X-100, 0.01% gelatin, 1.5 mM MgGh, 0.8 of both primers, 200 

15 nM dNTP, 2.5 U Taq polymerase and one tenth of the previously synthesized ss- 
cDNAs. Amplification conditions were: 94^C for 3min., 60°C for 2min., 72'^G 
for 30sec. then 29 cycles of 94^C for 30sec., 60^G for 45sec., 72°G for SOsec. 
The PGR products were subcloned using the pmosBlue PGR cloning system 
(Amersham) and sequenced. 

2 0 3' anchored PGR: Randomly primed ss-cDNAs were synthesized as above with 
a different random primer (RA3'-NV; 5'-TGGTTGAGAGTGGTAGGAGGAGA 
GTGAGGAGAGAGAGTAGAGGGTAGTGGNNNNNNNNN-3'; SEQ ID NO: 
13) to make it possible to carry out the 3' extension experiment described by 

28 

Ravassard et al. . Two specific nested primers were designed based upon the 
2 5 sequence of the 138 bp fragment corresponding to the bHLH domain of hngnS, 
hngn35'-l (nucleotides 663-683): 5'-AGACGACGCGAAGCTCACCA-3' 
(SEQ ID NO: 14) and hngn35'-2 (nucleotides 675-693): 5'- 
GCTCACCAAGATCGAGACGCTGCG-3' (SEQ ID NO: 15). Applicants 
obtained the 3 '-extended cDNA by performing two rounds of nested PGR, one 
3 0 with A3'-l : 5'-ATCGTTGAGACTCGTACCAGCAGAG-3' (SEQ ID NO: 16) 
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and hngn35'-l, and the other with A3 '-2: 5'- 

TCGTACCAGCAGAGTCACGAGAGAG-3' (SEQ ID NO: 17) and hngn35'-2. 
AppUcants used the reaction mix described above. Amplification conditions 
were: 94°C for 3min., 58°C for 45sec., 72''C for lmin.30 sec, then 34 cycles of 
5 94°C for 30sec., 58°C for 45sec., 72°C for lmin.30 sec. The PGR products were 
subcloned using the pmosBlue PGR cloning system and sequenced. 

5' anchored PGR or SLIC strategy (single stranded ligation of cDNA): Specific 
ss-cDNA was synthesized from human embryonic spinal cord RNA with an 
antisense primer based on the 3 '-extended cDNA sequence: hngn3RT 
10 (nucleotides 863-845) 5'-CTGCCAGCCTGGGAGACTG-3' (SEQ ID NO: 18). 
Single-stranded DNA was ligated with the A5'-NV: 5'- 
CTGCATCTATCTAATGCTCCTCTCGCTACCTGCTCA 
CTCTGCGTGACATC-3* (SEQ ID NO: 19) as described by Dumas Milne 

29 

Edwards et al. . Two rounds of nested PGR were performed with the first SLIC 
15 primer A5'-l : 5'-GATGTCAGGGAGAGTGAGCAGGTAG-3' (SEQ ID NO: 
20) and the specific primer hngn33'-l: (nucleotides 858-836) 5'- 
AGCCTGGGAGACTGGGGAGTAGA-3' (SEQ ID NO: 21), then with the 
second SLIC primer A5'-2: 5'-AGAGTGAGCAGGTAGCG AGAGGAG-3' 
(SEQ ID NO: 22) and the specific primer hngn33'-2: (nucleofides 747-726) 5'- 
2 0 CGCTATGCGCAGCGTTTGAGTC-3' (SEQ ID NO: 23). Both sets of PGR 
conditions were identical to those used for 3 '-anchored PGR. The PGR products 
were subcloned using the pGemT easy PCR cloning system (Promega) and 
sequenced. 

One of the main expression sites for Relax/ngn3 is the developing spinal 

2 5 cord in rat and mouse* ^' Applicants have isolated the human homolog of 

rodent ngn3 by producing a single-stranded cDNA (ss-cDNA) fi-om mRNA 
extracted fi-om the spinal cord of a 7-week old human embryo. PCR 
amplification was performed on the ss-cDNA using primers based on the basic 
and helix II regions of the Relax/ngn3 sequences. A 138 bp product was 

3 0 amplified (nucleofides 585-722 of SEQ ID NO: 9; see Figure 3). The sequence 
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of the cloned PGR product was 91.2% and 87.5% identical to the nucleotide 
sequences of ngnS and Relax respectively. It was 89.1% identical to the ngnl 
and ngn2 sequences. Thus, the isolated PGR fragment is the human homolog of 
a neurogenin, most probably Relax/ngn3, 

5 To obtain the full-length coding sequence corresponding to the 138 bp 

PGR fragment, 3'-anchored PGR on human embryonic spinal cord ss-cDNA was 
performed. A 655 bp PGR fragment was isolated, corresponding to a 3* 
extension of the original cDNA clone (nucleotides 675-1330 of SEQ ED NO: 9; 
see Figure 3). Open reading frame analysis showed that the entire G-terminal 

1 0 region (nucleotides 675-963 of SEQ ID NO: 9) and 366 nucleotides of the 3' 
untranslated region (nucleotides 964-1330 of SEQ ID NO: 9) had been cloned 
(Figure 3). Applicants obtained the 5' end of this cDNA, by performing 5'- 
anchored PGR by the SLIG method. Applicants amplified a 747 bp PGR 
fragment (nucleotides 1-747 of SEQ ID NO: 9) that overlapped the 3 '-extended 

15 cDNA clone over 73 bp (Figure 3). The two clones were obtained by 

independent PGR so it was necessary to demonstrate that the contiguous 
sequence corresponded to a real mRNA and not to a chimera. Applicants 
demonstrated by reverse transcription-mediated PGR using PGR primers of SEQ 
ID NO: 24 and SEQ ID NO: 25 that an 1 166 bp transcript is present in the 

2 0 human embryonic spinal cord, the sequence of which indeed corresponded to the 
alignment of the 3' and 5' fragments (nucleotides 1-157 and nucleotides 321- 
1330 of SEQ ID NO: 9). Nucleotides 158-320 of SEQ ID NO: 9 comprise a 5'- 
UTR interrupting intron identified by amplification performed on genomic DNA 
described below. Applicants completely sequenced both strands of four 

2 5 independent PGR products to rule out mutations induced by PGR. The 

nucleotide sequence was analyzed and the cloned cDNA was found to belong to 
the neurogenin family, and to be most closely related to Relax/ngn3. Applicants 
therefore named this cDNA human neurogenin 3 (hngn3). 

The hngn3 cDNA comprises an open reading frame downstream from an 

3 0 in-frame methionine (Figure 3). This hngn3 open reading frame is predicted to 
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encode a 214 amino acid polypeptide (SEQ ID NO: 10). Alignment of the 
predicted amino-acid sequence encoded by hngn3 with the sequences of the 
mice neurogenins. Relax (rat ngn3) and the human ngnl homolog (neuroD3) 
showed that most of the residues of the hngnS protein were conserved in its 
5 murine counterparts. Indeed, hngn3 shares 78.0% and 77.1% identity with Relax 
and ngn3 respectively whereas only 48%, 46%, and 44% identity is found for 
neuroD3, ngnl and ngn2 respectively (Figure 4). 

The genomic sequences of the three mice neurogenin genes and that of the 
Mash J gene have been analyzed and the entire coding sequences has been shown 

1 8 20 

10 to be contained within a single exon ' . Applicants therefore analyzed the 
structure of the hngnS gene by PCR using the set of primers (SEQ ID NO: 24 
and SE ID NO: 25) that yielded the 1 166 bp fragment in the amplification of 
spinal cord cDNA. Interestingly, the PCR product obtained from genomic DNA 
was 1330 bp long and comprised the polynucleotide sequence as depicted in 

15 SEQ ID NO: 9. Sequence analysis showed that the coding sequence was 

contained within a single exon and that there was a 1 63 bp intron interrupting 
the 5' untranslated region (nucleotides 158-320 of SEQ ID NO: 9); see Figure 
3). The hngn3 nucleic acid and hngn3 polypeptide sequence information has 
been deposited within the GenBank/EMBL database (Accession number : 

2 0 AJ133776). 

EXAMPLE 8: Phylogenetic analysis of the hngn3 related sequences. 

In this Example, Applicants carried out phylogenetic analysis of the 
relationships between the bHLH factors related to hngnS and found that the 
2 5 neurogenin genes were homologous to the Drosophila Tap gene and did not 
cluster with the atonal family. This study also shows that a great number of 
bHLH genes involved in neurogenesis are clearly misnamed. 

Optimal alignments of the sequences of the bHLH domains of the proteins 
most closely related to the protein encoded by our putative human neurogeninS 
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clone were subjected to phylogenetic analysis with distance methods. The tree 
obtained was rooted on the Drosophila Daughterless protein sequence (Figure 
5a). Five groups were consistently identified, the branching being well supported 
by the bootstrap values. The vertebrate neurogenin group has the product of the 
5 Drosophila tap gene as homology and is most closely related to the neuroD 
group, for which no invertebrate homolog has yet been identified. The atonal- 
related group comprises the nematode lin-32 protein and all the athl proteins. 

:=:=.j These three groups are more distantly related to other sequences of a group 

consisting of the tr^ist^ scleraxis^ par axis, E-hand and D-hand sequences and to 

0 10 the group of the achaete-scute complex. Alignments of whole sequences could 

only be used iox the neurogenin-r^XdiXQd sequences (Figure 5b), and made it 
possible to depict three paralogous groups in vertebrates, corresponding to ngnJ, 
ngn2 and ngn3 in the mouse. According to this classification, the human gene 

5 designated neuroDS is in fact orthologous to the other ngnl vertebrate 

15 sequences, and the Xenopus xngnrla gene is orthologous to ngn2. 

Z A few key conclusions may be drawn from the phylogenetic analysis. First, 

various paralogous groups of bHLH proteins were unambiguously defined, 
thereby helping to clarify the relationships within this transcription factor family 
(Figure 4a). All groups except neuroD contained vertebrate and Bilateria 
2 0 representatives (atonal and athl proteins, tap and ngn factors, AS-C-related 

proteins, twist-related factors). Drosophila atonal is commonly thought to be the 
archetype of this group, but the phylogenetic analysis does not support this 
contention. Atonal does belong to the athl group, but it is only distantly related 
to the neuroD and tap/ngn groups. It is not clear whether neuroD really has no 

2 5 Drosophila homolog or whether such a homolog has not yet been found. Four of 

these groups correspond to proneural or neural specification genes, and it may be 
that the ancestor genes of this group also had a proneural function that would 
impose some constraints on the structure-function relationships of the related 
proteins. Finally, three paralogous proteins belong to the vertebrate ngn 

3 0 subfamily (ngnl , ngn2, and ngn3), although only one is found in metazoans. 
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which diverged much earher during evolution such as Drosophila, This situation 
is reminiscent of what has been observed for many other family of genes 
expressed in the CNS, including homeobox proteins, growth factors, enzymes or 

2 1 22 

neurotransmitter receptors ' . Thus, as in the other gene families, the vertebrate 
5 ngn paralogous genes may have been generated by two gene duplications steps 
that occurred before the emergence of cartilaginous fish and, therefore, all 
vertebrate groups of species may have at least three representatives of the ngn 
subfamily of bHLH transcription factors. Finally, a phylogenetic approach 
~: should make it possible to unify the system of nomenclature of these proteins, 

0 10 some being clearly misnamed, such as Xenopus ngnrla, human neuroD3, and all 

g the ath proteins (-ath2 to 5) with the exception of the athl family. 

□ EXAMPLE 9: Chromosome mapping of the hnsn3 gene. 

Applicants used radiation hybrid mapping, to map the hngnS gene to 
2 15 chromosome 10q21 .3, with a linkage placement confidence of "lodscore>15". 

''^ Applicants determined the chromosomal location of the hngn3 gene by screening 

by PGR a panel of human/hamster radiation hybrids (Genebridge 4). Applicants 
first checked that our PGR conditions resulted in the amplification of a single 
1.3 kbp fragment from human genomic DNA whereas no PGR product was 
2 0 obtained from hamster genomic DNA. Applicants then amplified the 1 .3 kbp 
fragment from the genomic DNA of the 93 radiation hybrids. Applicants 
checked the specificity of the amplification by Southern blot analysis of the 
resulting PGR products using an internal hngnS probe. 

All PGR primers were chosen with the Oligo 4.0 software program. The 
2 5 nucleotide position of all the primers listed below and the hybridization probe 
are given relative to the genomic hngnS sequence (SEQ ID NO: 9). The 
following primers were used to screen by PGR the Genebridge 4 radiation hybrid 
panel: hngn3S (nucleotides 1-25): 5'-GCTGGGAGGGGATTGTGTGTTGTTT- 
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3' (SEQ ID NO: 24); hngnSAS (nucleotides 1330-1307): 5'- 
TGAGTGAGGGTAGGGCGACCCAAG-3' (SEQ ID NO: 25). 

PGR was performed with 25ng of genomic DNA from each radiation 
hybrid in a final reaction volume of 25 |al containing 0.8 |aM of each primer, 200 
I^M dNTP, 1.5 mM MgCl2, 10 mM Tris, pH 9, 50 mM KCl, 0.1% Triton X-IOO, 
0.01% gelatin and 2 U Taq polymerase. Amplification conditions were: 94°C for 
3min., 60°C for 2min., ITC for Imin,30sec, then 29 cycles of 94^*0 for 30sec., 
60°C for 45sec., 72°C for Imin30sec. 

The PGR products were loaded onto a 1% agarose gel and subjected to 
electrophoresis. The DNA was then transferred to a HybondN+ membrane 
(Amersham) under alkaline conditions and probed under standard conditions 
32 

with a P-labeled hngnS internal probe: (nucleotides 496-510) 5'- 
AGGAAGCTCCGGGCA-3' (SEQ ID NO: 26). The membrane was washed 
several times and autoradiographed. A specific 1.3 kbp fragment was detected in 
24 radiation hybrids of the 93 hybrids in the panel. The PGR results were 
processed using the Whitehead Institute Radiation Hybrid Mapping software, 
allowing only lod scores > 15 as minimal placement confidence. 

Precise chromosomal mapping was performed by linkage analysis using 
the Whitehead Institute radiation hybrid mapper. Applicants mapped the hngnS 
gene to chromosome 10 at 2.74cR from D10S1606 (WI-5300) and 2.53cR from 
Dl OS 141 8 (CHLC.GATA101E02) with a placement confidence of "lod 
score>15". Both markers have been mapped to 10q21 .3 and according to the 
Whitehead Institute radiation hybrid map of chromosome 10 they are 3cR apart. 
Thus, the hngnS gene maps to the region between the Dl OS 1606 and Dl OS 141 8 
markers on the long arm of chromosome 10, and more precisely in position 
10q21.3 (Figure 6). 

The hngn3 gene mapped to the long arm of chromosome 10, more 
precisely to 10q21.3. hngn3 maps to the region encompassing annexin VII 
(ANXT) to CaM kinase II gamma (CAMKG) and also to that encompassing 
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ankyrin 3 (ANK3) to prosaponin (PSAP). The latter genes have been mapped in 
either human or mice. Li mouse, the ANX7 and CAMKG genes have been 
mapped in mouse to chromosome 14 whereas the ANK3 and PSAP genes have 
been mapped to chromosome 10. Thus, the ngn3 gene is probably located in 
5 mouse either on chromosome 10 or 14. 

EXAMPLE 10: in situ localization of hnsn3 expression in the human 
developing embryo. 

In the developing human embryo, Applicants showed by in situ 
1 0 hybridization, that the hngn3 gene is expressed in the spinal cord and pancreas. 
Furthermore, the expression pattem of hngn3 was identical to that from ngn3 
genes in rat and mouse embryos suggesting that the function of ngn3 genes is 
conserved in mammals. 

Human embryos were obtained from legal abortions (Hopital R. Debre, 
15 Professors P. Blot and J. F. Oury, Paris) according to the guidelines of the 
French Ethical Committee. The embryos were aged from 6 to 7 weeks post- 
conception as determined by crown-to-rump length measured by ultrasound 
scanning and the Camegie stage was evaluated according to the general 
morphology. Embryos were fixed in 4% paraformaldehyde overnight, 
2 0 cryoprotected 48 hours in IX Phosphate buffered saline (PBS) containing 15% 
sucrose, embedded in 7% gelatin prepared in IX PBS with 15% sucrose and 
frozen in isopentane. 

1 4 |im serial transverse sections were performed and probed with either 
the antisense (SEQ ID NO: 27) or the sense (SEQ ID NO: 28) digoxygenin-1 1- 
2 5 UTP labeled hngn3 riboprobes according to the modified whole mount in situ 
hybridization protocol described by Wilkinson"^^. Negative controls with sense 
probes did not show any signals. 

Transverse sections through human embryos at Camegie stage 1 8 
(six and a half weeks) were probed with an hngn3 antisense riboprobe. No signal 
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was detected in the negative controls, probed with sense riboprobes (Figure 7b). 
hngnS-positive cells were found in the spinal cord and in the pancreas (Figure 
7a, 7c). As previously described for ngnS in the mouse embryo^ ^, hngn3 is 
expressed in scattered cells throughout the pancreas (Figure 7d). These cells 

1 8 

5 correspond, in the mouse embryo, to the precursors of the islets of Langherans . 
hngn3 was also expressed in the ventral region of the spinal cord (Figure 7a, 7e- 
h), in two longitudinal stripes located at two dorso-ventral positions. The first 
one is strictly adjacent to the floor plate (Figure 7a, 7e and 7g) and the second, 
\^ with a weaker intensity of staining is located around the midpoint between the 

0 10 floor plate and the roof plate (Figure 7a, 7e and 7h). Phase contrast microscopy 

clearly demonstrated the presence of Awg^wi-expressing cells inside the 
ventricular zone of the neural tube (Figure 7f). Li the most ventral stripe, hngn3- 
positive cells are present near the lumen of the neural tube and in the lateral part 

Zj 

Y\ of the ventricular zone (Figure 7g). Li the medial stripe, most of the hngn3- 

1 5 expressing cells are located near the marginal border of the ventricular zone 

Ij (Figure 7h). This pattern is very similar to that for Relax and ngnS in the spinal 

cord of developing rat and mouse embryos^^' Relax and ngn3 are also 
expressed in the hindbrain and in the ventral region of the forebrain close to the 
optic recess^^' Due to limitations inherent in the use of human embryos, all the 
2 0 samples collected were extensively damaged at the anterior part. Thus, 
histological analysis was not possible for the hindbrain or the forebrain. 
Nevertheless, Applicants found that hngn3 was expressed in the same 
compartments as Relax and ngn3 in the pancreas and the spinal cord. Our results 
suggest a conservation of function for the neurogenin 3 genes of humans, mice 

2 5 and rats. 

The expression pattern of hngn3 in the spinal cord is identical to that of 
Relax in rats and ngn3 in mice^^' In the pancreas, hngn3 expressing cells 

1 8 

were found in clusters similar to those described in the mouse pancreas . 
Several lines of evidence support the idea that ngn3 governs the differentiation 

1 8 

3 0 of the endocrine pancreas. Firstly, Sommer et al. have shown that the 
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expression of ngn3 and neuroD is coordinated both in space and time suggesting 
that ngn3 controls neuroD expression. Secondly, mice carrying a null mutation 

23 

in the neuroD gene have severe diabetes . Mutant mice have far fewer insuhn- 
producing B-cells and do not develop mature islets. Finally, it is most likely that 
5 ngnS initiates the differentiation of pancreatic B-cells by controlling the 

expression of neuroD and the subsequent production of insulin'^^. More recent 
data has demonstrated that ngn3 is the gene that controls the entire 
differentiation of the endocrine pancreas. Analysis of ngn mutant mice has 
shown that these mice lack the four pancreatic endocrine cell types (including 
10 the insulin producing cells). In addition, it has been demonstrated that the 

overexpression of ngn3 in the developing pancreas yields an increased number 

32 

of endocrine cells. Thus, any mutations in the hngnS gene could lead to severe 
diabetes. The hngnS gene has been mapped to the 10q21.3 position, hi this 
region, no genes for inheritable diseases with phenotypes consistent with a 

1 5 hngn3-2MtTQd function have been identified. The numerous studies performed on 
the genetic predisposition to type 1 diabetes mellitus have shown polygenic 
inheritance of the disease with a major locus at the major histocompatibility 
complex (MHC)" The various susceptibility loci outside the MHC locus 
remain to be identified. hngn3 is a strong candidate gene for the yet unmapped 

2 0 genes for insulin-dependent diabetes. It may also be involved in other human 

26 

diseases. Rostomily et al. have shown that the human neurogenin 1 homolog 
gene, NeuroDS, is expressed in primitive neuroectodermal tumors, especially in 
those that are highly metastatic. Thus, the analysis of hngnS expression in 
neuroectodermal tumors and in pancreatic b-cells tumors, such as insulinomas, 

2 5 may indicate whether the disregulation of the hngnS gene is involved in 

malignant transformation. 

The human homolog of neurogenin 3 (hngn3) fi-om human embryonic 
spinal cord has been now been identified by Applicants. Phylogenetic analysis 
showed that the neurogenin genes are homologous to the Drosophila Tap gene 

3 0 and that the atonal gene is certainly not the archetype of this gene family as was 
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previously suggested. A radiation hybrid panel was screened, leading to the 
mapping of the hngn3 gene to the 10q21.3 position. Early human embryos were 
used to investigate the pattern of hngnS expression. Most importantly, the cells 
expressing hngnS were located in the same compartments of the spinal cord and 
5 pancreas as those expressing ngn3 genes in rats and mice. Thus, these data 
support the idea of a functional conservation among the neurogenin 3 genes. 

The structure of the various neurogenin genes appears to be conserved, 
with, as described here for the hngn3 gene, the entire coding sequence contained 
within a single exon. Similar genomic organizations have been reported for the 

1 0 other three mouse ngn genes, the Zebrafish ngnl gene and the mouse Mas hi 

gene^^' The identification of a single intron interrupting the 5' untranslated 
region of the hngn3 gene is similar to the organization of that of two other 
members of the ngn family, the mouse ngn2 and Zebrafish ngnl genes^^' The 
structures of the other known neurogenins have not yet been fully analyzed. 

15 However, the presence of an intron within the 5'UTR and the absence of introns 
within the coding region may be common features of neurogenin genes. 

ABBREVIATIONS 

bp: base pairs, kbp: kilo base pairs, cR: centiRays 
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